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FOREWORD 


HIs volume organizes some new thinking and research on 
measurement and prediction in social psychology and sociology. 
It grew out of the work of the Research Branch of the Information 
and Education Division in the War Department in World War II. 

Back of the substantive findings of the Research Branch, reported 
in Volumes I and II of this series, stands a wide variety of method- 
ological research. Selected contributions are described in detail in 
Volume ITI and in the present volume. 

In Volume ITI, methodological problems involved in the use of 
controlled experiments are analyzed in some detail, especially as 
they apply to the field of mass communication. 

In Volume IV the first eleven chapters deal with a theoretical and 
empirical analysis of problems of measurement—particularly the 
development of models of ordered structures or scales and practical 
procedures for testing their adaptability to a given area of socio- 
psychological content. Chapters 12 through 16 deal with problems 
of prediction as illustrated by two of the major efforts of prediction 
made by the Research Branch. 

Chapter 1 provides an introductory overview to the contributions 
to measurement and Chapter 12 performs the same office for the 
contributions to prediction. 

These reports leave out other fields of methodological endeavor in 
which the Research Branch was interested. To cite one example, 
efforts were made to improve methods of interviewing. Special 
attention was given to informal and unstructured or partly struc- 
tured interviews, as supplementary to the more standard question- 
naire techniques generally used in the Branch. Some of this experi- 
ence has been published elsewhere.! 

As the introductory chapter in Volume I of this series pointed out, 
the mission of the Branch was one which might be called an engi- 
neering rather than a scientific one. To carry out the practical 

See, for example, Robert K. Merton and Patricia L. Kendall, “The Focused Inter- 
view,” American Journal of Sociology, Vol. 51, No. 6 (May 1946), pp, 541-557. 
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engineering tasks, there was need for better research techniques. 
Work on these techniques was, of necessity, incidental to the main 
responsibility of getting reasonably accurate information quickly 
into the hands of those who needed the information for policy deci- 
sions. Hence, many of the social scientists, being keenly sensitized 
to the shortcomings of techniques in current use, were often frus- 
trated at not having more time to turn away from immediate prob- 
lems to intensive study of these methodological challenges. Never- 
theless, some progress was made, as will be seen. 

In Volume I, Chapter 1, the view was expressed that the future 
of social psychology and sociology calls for three developments: 

1. Formulation of theories, at least of some limited generality, 
which can be operationally stated such that verification is possible, 
and from which predictions can be made successfully to new specific 
instances. 

2. Such theories demand that the objects of study be isolated and 
accurately described, preferably by measurement. 

3. Once the variables are identified, the test of adequacy of the 
theory, in comparison with alternative theories, must be rigorous, 
preferably evidenced by controlled experiment, and preferably repli- 
cated. 

By its contributions to measurement and prediction this volume 
seeks to accelerate the advance of social science. 

This is not a textbook. Neither is it a comprehensive treatise. 
Rather, it organizes, as compactly as possible, a considerable amount 
of fresh thinking on these problems, with copious illustrations from 
Research Branch data. 

While the basic concepts on which this volume is based developed 
in large part in response to need for methodological improvement 
of practical research operations in wartime, thinking has not stood 
still since the war. This volume incorporates subsequent reflection 
and research, although all empirical data are from Research Branch 
materials. In the measurement field, for example, Louis Guttman 
has extended and generalized the approaches which he initiated for 
use in the War Department, and Paul F. Lazarsfeld, whose work 
began in the closing stages of the war, has developed most of his sys- 
tematic theory subsequently. Completion of this volume has been 
made possible by a grant to the Social Science Research Council by 
the Carnegie Corporation and by assistance from other sources. 
The Harvard University Laboratory of Social Relations and the 
Columbia University Bureau of Applied Social Research have con- 
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tributed facilities and funds. In 1948-1949 the RAND Corpora- 
tion under Air Force Project RAND made available to Harvard 
University a grant for further theoretical and empirical re- 
search on socio-psychological measurement. While only partial 
findings of the RAND study are available as this volume goes to 
the printer, the experience on this study has been of inestimable 
value in broadening the perspective of several of the authors of the 
present volume and leading them to make late revisions in their 
manuscripts—in some cases, revisions which are quite fundamental 
in character. Other contributions to this volume also draw on 
postwar research. John A. Clausen’s Chapters 15 and 16 on the 
prediction of what veterans would do after discharge, for example, 
depend in part upon data collected by the Veterans Administration 
as a follow-up of the same individuals studied by the Research 
Branch. 

Special acknowledgment is due to Frederick Mosteller, who con- 
tributed wise mathematical criticism to several of the chapters. 
Preparation of the final manuscript was the responsibility of Stuart 
Cleveland. The major proofreading task was performed by 
Blanchard Lyon, who also prepared the index. 

While this is a methodological volume, the content material used 
for illustrative purposes in some of the chapters has an intrinsic sub- 
stantive interest. For example, in Chapters 13 and 14, by Shirley 
A. Star, tables are presented on the scores made on a psychoneurotic 
inventory by all men in the United States inducted during a single 
month. Data are given separately for each induction station. 
These findings conceivably may have considerable impact on psy- 
chiatry, since they provide for the first time a bench mark against 
which to interpret the almost fantastic variations from one induction 
station to another throughout the nation in psychiatrists’ diagnoses. 

Most of the chapters can be read by the nonmathematician. But 
some chapters demand mathematical literacy, and all are written 
for the serious student. 

We expect some of the topics to be rather highly controversial. 
Several chapters represent a challenge to conventional thinking in 
this field. But they are not conceived by their authors as defini- 
tive. It is the hope that the study of these pages will inspire a 
younger generation of social scientists to do better the tasks here 
undertaken. 

SAMUEL A. STOUFFER 
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MEASUREMENT AND PREDICTION 


CHAPTER 1 


AN OVERVIEW OF THE CONTRIBUTIONS 
TO SCALING AND SCALE THEORY 


Introduction 


ITAL to the development of social psychology and sociology as 
V scientific disciplines is the definition and classification of the 
objects of study. 

This requires rigorous yet economical methods for handling data 
which are initially qualitative, not quantitative. The objective of 
much of the Research Branch methodological endeavors, and of the 
first eleven chapters of this volume, is to deal with theoretical models 
of ordered structures or scales and with technical procedures for testing 
the applicability of a particular model to a particular set of qualitative 
data 


In the course of this work a number of general theories have been 
developed which go well beyond present computational possibilities 
but which may have important implications for the future. 

Some of these results have already been published. Others have 
been presented as papers before professional societies. In these 
chapters the work is brought together in one place, in a form which, 
it is hoped, will be maximally useful both to the researcher who 
wants to make practical use of the new tools and to the scholar who 
seeks to sharpen his knowledge of the shortcomings of these tools in 
order to develop better ones. 

It is to be expected that considerable sections of these chapters 
will be controversial. The critical papers which have already ap- 


1 By Samuel A. Stouffer. While the author has had the benefit of critical readings 
of drafts of this chapter by the authors of Chapters 2 to 11, namely, Louis Guttman, 
Edward A. Suchman, and Paul F. Lazarsfeld, as well as by other experts in the field, 
he must assume responsibility for the summary statements in the present chapter. He 
has sought as faithfully as possible to represent the points of view of these authors and 
to reconcile divergencies of opinion, though not always succeeding completely, perhape, 
to each author's satisfaction. 
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peared, based on fragmentary advance publications, are testimony 
to the vitality of the problems with which these chapters deal.? 

Origmal and challenging as some of the new concepts here intro- 
duced may be, they represent only one small body of effort in a task 
whose magnitude is likely to absorb the energies of some of our best 
minds in psychology and sociology for many years to come. Itisa 
task which has elicited effort from able scholars long before the war. 

The thinking in these chapters owes a special debt to the prewar 
work of L. L. Thurstone of the University of Chicago, whose adapta- 
tion of psychophysical methods to attitude measurement was one of 
the earliest achievements in combining systematic theory and com- 
putational techniques in this area and whose subsequent leadership 
in the field of factor analysis built upon the work of Spearman and 
opened new vistas in psychological measurement generally. There 
are many other scholars whose research added to our knowledge. 
No systematic attempt will be made here to summarize this litera- 
ture. It is assumed that the reader has some familiarity with it 
and, in any case, a number of summaries are available.* 

Most of that work represented an effort to apply quantitative 
methods to qualitative data. While the reader will soon discover 
that the present volume involves some quite diverse approaches 
based on diverse premises, the one thing which these chapters have 
in common, differentiating them from the prewar work, is their effort 
to treat qualitative data as qualitative, not quantitative. 

Perhaps the most drastic departure from earlier approaches is 
represented in the initial thinking of Louis Guttman. In 1940, just 
before the war, Guttman contributed a series of studies on the logic 
of measurement and prediction to a monograph of the Social Science 
Research Council. This work contained the basic principle of ideas 

* For a discerning summary of this literature, see Jane Loevinger, ‘‘The Technic of 
Homogeneous Tests Compared with Some Aspects of ‘Scale Analysis’ and Factor 
Analysis,” Psychological Bulletin, Vol. 45, No. 6 (November 1948), pp. 507-529. 

* Thurstone’s approaches to attitude measurement based on psychophysical methods 
are summed up in a recent paper by him published as Chapter 5 in T. G. Andrews 
(editor), Methods in Psychology (John Wiley & Sons, Inc., New York, 1948). On his 
factor analysis, see L. L. Thurstone, Multiple Factor Analysis (University of Chicago 
Press, Chicago, 1947). The best prewar elementary textbook on measurement in psy- 
chology is probably J. P. Guilford, Psychometric Methods (McGraw-Hill Book Com- 
pany, New York, 1936). David Krech and Richard 8. Crutchfield, Theory and Prob- 
lems of Social Psychology (McGraw-Hill Book Company, New York, 1948) has a 
relatively up-to-date bibliography in Chapter 7, ‘‘The Measurement of Beliefs and 
Attitudes.” One of the most complete bibliographies of the literature on attitudes in 
general is that in Muzafer Sherif and Hadley Cantril, The Psychology of Ego-Involve- 
ments (John Wiley & Sons, Inc., New York, 1947). 

* Louis Guttman, “The Quantification of a Class of Attributes: A Theory and Method 


for Scale Construction,” in P. Horst et al., The Prediction of Personal Adjustment (Social 
Science Research Council. New York, 1941), pp. 319-348, 
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which he was to apply in the Research Branch a year later and which 
were to be greatly expanded theoretically and adapted for quick 
practical use. 

Guttman offered a model which dispenses with the concept of a 
latent or underlying continuum to which the response to a particular 
item is to be related. He considered an attitude area “‘scalable”’ if 
responses to a set of items in that area arranged themselves in cer- 
tain specified ways. In particular, it must be possible to order the 
items such that, ideally, persons who answer a given question favorably 
all have higher ranks than persons who answer the same question un- 
favorably. From a respondent’s rank or scale score we know ex- 
actly which items he endorsed. Thus we can say that the response 
to any item provides a definition of the respondent’s attitude. 

Guttman and his associates in the Research Branch developed 
simple and practical techniques for testing hypotheses as to whether 
attitude areas were scalable by this definition. Not all attitude 
areas satisfied the criteria of goodness of fit of the model, but it was 
possible to find many areas which seemed to be scalable and several 
hundred such scales were worked out during Research Branch ex- 
perience. Also important, theoretically, is the fact that the ranking 
of people by the Guttman model appears to represent only one of a 
set of principal components. In fact, if there are m scale types or 
rank groups, in the case of perfect scalability, there are m principal 
components which Guttman found to have a definite law of for- 
mation. The first component is a monotonic increasing function of 
the ranks—a straight line in the special case where the frequencies 
are the same for each rank group. The second component will al- 
ways be a U-shaped or J-shaped curve with one bend in it. This 
has been interpreted by Guttman as a measure of intensity, leading, 
in the ideal case, to the determination of an objective zero point, at 
the point where the curve of intensity is a minimum. 

The perfect Guttman scale is not likely to be found in practice, 
but satisfactory approximations to it are not so rare as some critics 
of the early papers describing it have implied. In analyzing data 
which failed to fit the model, Guttman has distinguished between 
two classes of misfits. One, which he calls the quasi scale, has the 
property that the errors are at random. He proves that the corre- 
lation of the quasi scale with an outside criterion is the same as the 
multiple correlation between responses to the individual items form- 
ing that scale and the outside criterion. This important result leads 
to great operational economies and justifies the use of sets of items 
from an area not scalable in his strictest sense. If the condition of 
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random errors is not satisfied we are confronted with what are called 
nonscale types. These nonscale types set problems for further anal- 
ysis, by indicating the presence of more than one variable. Inten- 
sive study in any given situation may lead to the dissection of a 
supposed attitude area into two or more subareas each of which may 
turn out to be scalable, in Guttman’s sense, when analyzed sepa- 
rately. 

Further research is needed on the quasi scale viewed simply as a 
system of manifest responses, without necessarily postulating an 
explicit underlying or latent continuum. 

In the last years of the war, Paul F. Lazarsfeld, while a consultant 
in the Research Branch, became especially concerned with these 
quasi scales. For several years he had been interested in the classi- 
fication of attributes, and especially in the properties of partial four- 
fold tables, on which Yule had written many years before with much 
insight in the opening chapters of his Introduction to the Theory of 
Statistics. As a result of his studies, Lazarsfeld proposed a fresh 
attack on the fundamental conceptualization of the scaling prob- 
lem. 

What Lazarsfeld proposed was a return to the older concept of a 
latent attitude continuum. Guttman, before the war, had studied, 
perhaps more thoroughly than any other person, the possibilities of 
this approach but had turned for the time being in a new and dif- 
ferent direction because he saw clearly that conventional methods 
of quantitative factor analysis, stemming from Spearman, were in- 
appropriate for handling qualitative data. Lazarsfeld’s primary 
achievement was to bring forth a new model for directly factoring 
qualitative data. Therefore, it would now seem to be practical to 
test quite rigorously the following basic hypothesis: There exists a 
set of latent classes, such that the manifest relationship between any two 
or more items on a test can be accounted for by the existence of these 
basic classes and by these alone. 

This implies that any attitude item has two aspects—one which is 
associated with the latent classes and one which is specific to the 
item. The specific aspect of any item is assumed to be independent 
of the latent classes and also independent of the specific aspect of 
any other item. 

The analogy with the Spearman-Thurstone approach in quanti- 
tative factor analysis and indebtedness to it is obvious. The con- 
trast with Guttman’s approach also should be clear. Guttman’s 
model deals only with the manifest relationship among attitude 
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items and defines an attitude directly as the observed responses to 
these items. Lazarsfeld defines an attitude as an inference as to 
latent classes, tested by fitting to the manifest data an appropriate 
latent structure model. 

Lazarsfeld’s approach is quite general as to the number of latent 
classes. Moreover, these classes may or may not be ordered. In 
the very important special case where they are ordered along a 
single dimension, Lazarsfeld has achieved further generalizations of 
importance. He conceives of the latent classes as segments of a 
continuum z. Over this latent attitude continuum, the probability, 
p(x), of a “favorable” response to a given specific attitude item, may 
be described as a continuous function of x. Such a “trace line’’ can 
take any shape, but the study of relatively simple types of ‘‘trace 
lines,” such as the straight line and polynomials of second and third 
degree, is proving to be of much interest. 

It turns out, further, that Guttman’s quasi scale can be derived 
analytically as a special case of latent structure analysis, Guttman’s 
perfect scale becoming a limiting case of the quasi scale. 

In comparing the approaches of Guttman and Lazarsfeld to the 
problem of testing whether or not ordered structures apply in a 
given attitude region, it is important to avoid metaphysical faith in 
a particular model. For example, if one can dispense with the con- 
cept of a latent structure and operate effectively directly with mani- 
fest data, as Guttman does, there could be advantages of logical 
parsimony in the Guttman approach. The fact that a perfect 
Guttman scale is not obtained except approximately is not in itself 
a denial of the value of that model. The crucial question is whether 
it has ideal properties from which one may make rich and varied 
logical deductions—as seems to be possible in connection with the 
theory of. principal components—and whether it leads to rapid and 
enlightening empirical tests of hypotheses as to the structure of con- 
crete attitude areas. Similarly, it will be noted that Lazarsfeld’s 
postulate of a continuous trace line p(x) cannot itself be empirically 
demonstrated. However, as will be shown in this volume with 
numerical examples from Research Branch data, the hypothetical 
latent continuum z can be cut into segments and discontinuous 
values of p(x) determined empirically for each segment. The latent 
continuum is a hypothetical construct and its usefulness lies in its 
range of logical productivity and its effectiveness in guiding the 
analysis of actual data. 

In this volume, we have not been content merely to present the 
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mathematics of these new approaches to the problem of testing 
hypotheses of ordered structures. We have sought also to provide 
the details of computing techniques by which various hypotheses 
can be tested empirically. This volume contains many concrete 
examples from Research Branch data. The development of com- 
puting equations and of short-cut methods for clerical operations is 
second only to the formulation of the basic theory in its importance 
for progress in this field. 

The initial objective in applying one or another of the various 
proposed models is to discover whether, with a given set of items, 
an area is scalable. A second objective, if a scalable dimension is 
found, is to order respondents along that dimension in a practical 
operation. 

There are many methods in current use for ordering respondents. 
The term “item analysis’ is commonly used to describe most such 
operations. Usually, a provisional total score is computed by add- 
ing up a person’s responses to items initially weighted, and decision 
as to whether or not to retain a particular item in the “‘scale’’ turns 
on how highly the responses to it are correlated with the provisional 
total score. The objection to this procedure is that, at the end, one 
is still likely to be retaining several dimensions in a single so-called 
“scale.” Such a ‘‘scale’’ may appear to order respondents, but it 
is not backed by a theoretical model which provides a serviceable 
criterion that these respondents have been ranked along one dimen- 
sion only. 

The Research Branch, like most practical research agencies, faced 
a situation in which it was usually not economical to use more than 
a few items in a given attitude scale. Or, if the presence of several 
dimensions was suspected, it was not possible to use large batteries 
of tests to isolate these dimensions. Moreover, it was adequate 
to order people in a few rank groups, provided there was a single 
demonstrable dimension. It was not necessary, nor is it likely to 
be necessary in much of the social and psychological research of the 
near future, to have a very large number of rank groups. 

One of the fortunate results of the developments reported in the 
following chapters is the evidence that—if an attitude area is scal- 
able—a relatively small number of items is sufficient for classifying 
people for many practical purposes. Thus, while a larger number 
of items may be required in pretests in order to study the structure 
of each attitude, only a small number of items need be used in a final 
study for those attitudes which are scalable. 
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It may be objected that the price one pays for unidimensionality 
is loss of generality. That can be true. But, if a given region of 
attitudes is not unidimensional it does no good to close our eyes to 
that fact and pretend that it is. Instead, if the region yields sev- 
eral quite specific scales, each clearly unidimensional, we are then 
in a position to proceed more economically with an explicitly multi- 
ple dimensional study, testing theories of the interrelationships of 
the scales used. This study may require us to try eventually to re- 
duce the number of these initial dimensions by some form of quanti- 
tative factor analysis. But the first step is analysis of the structure, 
in an effort to isolate clean unidimensional specific scales. 

In order to give the reader further orientation as to the viewpoints 
and findings of the chapters to follow, we shall now review in some- 
what more detail, without mathematics, (1) the Guttman approach 
to the testing of hypotheses as to ordered structures and (2) the 
latent structure approach as developed by Lazarsfeld. These will 
be viewed in terms of the objective of finding among respondents a 
rank order along a particular attitude continuum, if such exists. 
Finally, (3) we shall take a preliminary look at the problem of utiliz- 
ing indices of intensity of attitude to locate a “zero point” or “re- 
gion of indifference.” 


1. The Scalogram Theory of Establishing Rank Order 


The approach which was developed in the Research Branch under 
the guiding hand of Louis Guttman has been named scalogram anal- 
ysis. Several attitude areas were analyzed during the war to see 
whether items from these areas could be accepted as scalable. Some 
of them have been used in earlier volumes of this series. As experi- 
ence proceeded, some of the earlier criteria were revised, and many 
of the areas once deemed sufficiently scalable do not satisfy the more 
rigorous criteria subsequently imposed. But the general logical out- 
line, which is quite a simple one, has stood the test of a wide variety 
of applications. 

The scalogram hypothesis is that the items have an order such 
that, ideally, persons who answer a given question favorably all have , 
higher ranks on the scale than persons who answer the same question 
unfavorably. From a respondent’s rank or scale score we know ex- 
actly which items he endorsed. Thus, ideally, scales derived from 
scalogram analysis have the property that the responses to the indi- 
vidual items are reproducible from the scale scores. 

In practice, this ideal is not perfectly attained. In Chapters 3 « 
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and 4 standards are set forth for accepting data as constituting such 
ascale. There have been some misunderstandings in the profession 
as to these standards, and as to the reasoning as well as the em- 
pirical experience on which they are based. 

The standards have been and doubtless will be criticized from two 
quite opposite points of view: (a) that they are too lenient, permit- 
ting areas to be accepted as scalable when they are not, and (b) that 
they are too stringent, limiting too severely the number of areas 
which can be accepted as scalable. There is an arbitrary element 
of judgment involved in arriving at a standard, just as there is in 
arriving at a standard such as the 5 per cent level for a test of sig- 
nificance in sampling theory. It is quite possible that some of the 
standards here proposed may be changed with increasing experience. 
Research Branch experience has shown that when the proposed cri- 
teria are adopted, many areas have been found to be scalable; rank 
scores based on even a small number of items from such an area 
necessarily have high test-retest reliability (according to a theorem 
of scalogram analysis) and have been found in practice to correlate 
satisfactorily with external variables. 

The items used in a scalogram analysis must have a special cumu- 
lative property. The general idea may be illustrated by a hypo- 
thetical scale of stature comprised of responses to three items: 


1. Are you over 6 feet tall? _____ Yes No 
2. Are you over 5 feet 6 inches tall? __—. Yes No 
3. Are you over 5 feet tall? a= =. Yes No 


If a person checks item 1 “Yes,” he must, unless he is careless, 
also check items 2 and 3 “Yes.’’ If he checks item 1 “No” and 
item 2 “Yes” he must also check item 3 ‘‘Yes.”” Hence, if we give 
a score of 2 to a man who has endorsed two items we know exactly 
which two items he endorsed. He could not say ‘‘Yes” to item 1, 
“No” to item 2, and ‘Yes’ to item 3. The four admissible response 
patterns to the three items are shown below: 


Rank order of Says yes to Says no to 
respondents Score ttem ttem 
1 2 8 1 2 8 
1 3 xX X X 
2 2 xX xX x 
3 1 x x XxX 
4 0 xX xX X 
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This simple diagram is called a scalogram—hence the name scalo- 
gram analysis for the procedure. 

It is essential to note that the items are cumulative. A man who 
answers ‘‘Yes’’ to item 1 has the stature of a man who answers “‘Yes’’ 
to item 2 and “No” to item 1, plus additional stature. Thus the fol- 
lowing items would not yield a scale in a scalogram analysis. They 
are not cumulative. 


1. Are you over six feet tall? =>. Yeu No 
2. Are you between 5 feet 6” and 6 feet tall? _.___._ Yes No 
3. Are you between 5 feet and 5 feet 6” tall? ____ Yes No 
4. Are you under 5 feet tall? —__— Yes No 


Here it is possible to make only one affirmative response and we 
must use additional knowledge of the fact that a man who says 
“Yes” to item 1 is taller than a man who says “‘Yes’”’ to item 2 if 
we are to order the items. The scale picture would look as follows: 


Rank order of Says yes to Says no to 
respondents item item 
1 2 $s 4 1 2 $s 4&4 
1 x xX X X 
2 x x xX X 
3 xX xX X x 
4 ».« xX xX X 


In this example, the order of the items is implicit in the structure 
of the questions. Where this is not the case, a device sometimes 
used is to have a set of judges rank the items in advance. The con- 
sistency with which the judges agree in their initial ranking can be 
evaluated. But since only one answer is permissible, there is no 
test of internal consistency of the subsequent respondents. Such a 
test. could be devised by, for example, asking each respondent to 
check “Yes” to the two items closest to his position. This would 
reduce the number of rank groups, but would provide objective evi- 
dence of order such that initial judges would no longer be needed, 
although they might still be useful. The scale picture would be: 


Rank order of Says yes to Says no to 
respondents tem ttem 
1 5 8 4 1 2 8 4 
1 x Xx xX X 
2 xX X x x 
3 xX Xx xX xX 
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It would seem to be quite possible to use as a model this type of 
“interval” or “position” scale. In fact, Frederick Mosteller has 
developed the theory of such a scale, in connection with a study of 
scaling sponsored by the RAND Corporation at the Harvard Lab- 
oratory of Social Relations. It will be noted that the cumulative 
type of item which fits the scalogram model will not fit this model, 
and vice versa. 

One can handle the stature items 1, 2, 3, and 4 above in still other 
ways. For example, the respondent might be asked to rank the 
four stature intervals in terms of their closeness to his height. If 
he is 5 feet, 2 inches tall, the order would be interval 3, rank 1; 
interval 4, rank 2; interval 2, rank 3; interval 1, rank 4. For all 
respondents, we would get the following scale picture (the man 
whose stature is 5 feet, 2 inches, is in rank group 5, for example): 


Rank order of Rank assigned to 
respondents item number 

1 2 8 & 

1 1 2 3 #4 

2 2 1 3 64 

3 3 1 2 4 

4 4 2 1 3 

5 4 38 1 2 

6 4 3 2 1 


The same result could be derived from a set of n(n — 1)/2 paired 
comparisons, which would have the additional advantage of pro- 
viding a check on the internal consistency of an individual as well as 
the internal consistency of a group of individuals. Questions would 
be of the type: 


Which of these is closer to your stature? (Check one) 
Over 6 feet 
Between 5 feet 6” and 6 feet 


All of these examples illustrate how a model can be set up without 
explicitly postulating a latent continuum. The ordering of manifest 
responses if perfectly consistent will reveal the appropriate structure.® 


5 Thurstone’s law of comparative judgment postulates an underlying continuum and 
seeks to derive, usually with the aid of the method of paired comparisons, not only a 
rank order but a metric. In psychophysical measurement, such as the classical example 
of lifted weights, the assumption is made that each individual respondent has the same 
position and that variations in judgment either within the same individual or between 
8 group of individuals represent discriminal error. This discriminal error provides a 
unit of measurement. But in the present example, each individual can be perfectly 
consistent in all his responses, yet because each individual can have a different position 
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The scalogram model, requiring cumulative items, has at the present 
time been studied far more thoroughly than these others. There- 
fore, our attention will now be focused on it. We must keep clearly 
in mind the fact that the cumulative character of the items satis- 
fying a scalogram analysis puts a restriction upon the type of data 
which can be used with this model. Yet the restriction is not, in 
practice, as severe as may at first appear. 

It is often possible to find items which have an intrinsic cumu- 
lative character. The prototype is perhaps the social distance 
scale, with such items as the following: 


1. Would you want a relative of yours to marry a Negro? 
2. Would you invite a Negro to dinner at your home? 
3. Would you allow a Negro to vote? 


An illustration from Research Branch data of this type of intrinsic 
scale may be taken from a study of riflemen overseas who had re- 
cently experienced combat. The data are presented more in detail 
in Chapter 5. It was desired to see if there was a scale order in the 
way in which respondents reported experiencing fear. The follow- 
ing question was asked: 


Soldiers who have been under fire report different physical reactions to 
the dangers of battle. Some of these are given in the following list. How 
often have you had these reactions when you were under fire? Check one 
answer after each of the reactions listed to show how often you had the re- 
action. Please do it carefully. 

[There followed 10 items, each with a four-step check list. For ex- 
ample: 

Shaking or trembling all over 
Often 


A scale picture (shown in Chapter 5, Scalogram 7) indicated that 
nine of these items formed a very satisfactory scale when ordered as 
follows: 


1. Urinating in pants 

2. Losing control of the bowels 
(i.e., different stature), the group variability will not represent discriminal error. De- 
tailed analysis of this phenomenon has been made by Clyde H. Coombs, in connection 
with his work on the RAND study at the Harvard Laboratory of Social Relations. 
Among other things, he has shown that a rank order table like that above, or a paired 
comparisons table, does reveal under certain conditions whether some rank intervals 
have greater magnitude than others. 
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. Vomiting 

. Feeling of weakness or feeling faint 

. Feeling of stiffness 

. Feeling sick at the stomach 

. Shaking or trembling all over 


. Sinking feeling of the stomach 
. Violent pounding of the heart 


© CO SI O2 Or OO 


One item—*‘cold sweat’’—did not fit into the scale; that is, some 
people who experienced less frequent fear symptoms than cold sweat 
also experienced cold sweat, but this was also true of some people 
who experienced more frequent symptoms than cold sweat. Hence 
“cold sweat” was shown to involve a factor or factors additional to 
the scale variable. 

The fact that the nine items satisfied the scalogram criteria means 
that if a man did not report vomiting, for example, he also did not 
report urinating in his pants or losing control of his bowels. If he 
did report vomiting, he also reported all of the other experiences 
(4 through 9 on the list) which were generally more frequent than 
vomiting. In this case he would have a score of 7 and if the scale 
were perfect we would know exactly which seven experiences he re- 
ported. Actually, of course, the scale was not quite perfect. By 
» an easy procedure, described in Chapter 4, it was possible to com- 
pute a coefficient of reproducibility, which was .92. This means that 
if we knew any scale score, such as 7, and if we guessed the exact 
items which respondents with this scale score endorsed and did not 
endorse, we would guess 92 out of 100 of the items correctly and 8 
out of 100 incorrectly. The scale picture also met other criteria 
described in Chapters 3 and 4, with respect to individual item repro- 
ducibility, wide range of margmals, and randomness of error. 

It may be that only a limited range of psychological and social 
phenomena have the intrinsic cumulative characteristic described 
above. An area which should eventually be quite rich in data of 
this type should be that of level of difficulty. For example, three 
problems in calculus: 


1. Integrate = ayly — 0) 


dy 
2. Integrate ant 


3. Differentiate y = 2° 


It is unlikely that a person who can do problem 1 would fail on prob- 
lems 2 and 3, and it is also unlikely that a person who failed on prob- 
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lem 2 could do problem 1. In other words, if A has a higher score 
than B, A can do all that B can do, plus something more. 

The possibility should be faced, however, that large areas of psy- 
chological or social behavior may not yield items which approximate 
the scale pattern required by scalogram analysis. For example, a 
short catalogue of psychosomatic symptoms which the Research 
Branch used in constructing a psychiatric screening test simply did 
not order itself in a manner satisfying the rigorous criteria set up. 
That is, a man who said he was bothered with shortness of breath 
might or might not complain of spells of dizziness, and vice versa. 
As we shall see later, it is still possible to construct a scale—cal'ed 
a quasi scale—which lacks the property of reproducibility but which 
does have the valuable property of yielding the same correlation with 
an outside criterion as does the multiple correlation of the individual 
items with that criterion. 

It is frequently possible, however, to order items cumulatively 
(and hence to order respondents) by constructing items with multi- 
ple check lists and choosing cutting points by combining categories 
such that the error of reproducibility is minimized. 

Consider, for example, three items which happen to have uniform 
format. (Such uniformity is not at all essential—the number of 
response categories can vary arbitrarily from item to item and the 
wording of the categories can be varied): 


1. How many of your officers take a personal interest in their men? 


All of them 
2____ Most of them 
3 _____ About half of them 
4 Some of them 
5 Few or none of them 
2. How many of your officers will go through anything they ask their men 
to go through? 
1 All of them 
2 Most of them 
3 _____. About half of them 
4 Some of them 
5 Few or none of them 


3. How many of your officers are the kind you would want to serve under 


in combat? 
1 ____ All of them 
2 _____. Most of them 
3 —_____ About half of them 
4 Some of them 
5 Few or none of them 
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If each item is dichotomized by taking the two top categories as 
“favorable” responses, the three items will not ordinarily form a 
scale by scalogram analysis. In a sense, in spite of manifest dif- 
ference in content, these three particular items are more or less 
synonymous. Hence, the frequencies probably will not be cumu- 
lative. However, if for item 1 we take response category 1 as 
“positive,” for item 2 response categories 1 and 2, and for item 3 
response categories 1, 2, and 3, we perhaps can build a cumulative 
set of items. This is, of course, just one of various ways in which a 
cumulative set of responses might be built from these items. The 
procedure for selecting cutting points which will maximize scalability 
has been reduced to a simple routine and is described in detail in 
Chapter 4. It is not arbitrary, but is based upon a study of the 
simultaneous distribution of all the original responses. From a 
practical standpoint, the operational procedures developed in the 
Research Branch for swiftly evaluating a large number of questions 
simultaneously may rank as one of the major contributions of the 
Branch to social science technique. Work which would have re- 
quired hundreds of hours of elaborate machine analysis can now be 
done in a few hours by one semi-skilled clerk. The reader will find 
a full description of the operations in Chapters 4 and 5, with a dia- 
gram for the construction of a scalogram board, which remarkably 
facilitates the scaling process. 

There can be no doubt that the empirical choice of eine points 
on more or less synonymous items in order to achieve a set of cumu- 
lative responses is rather crude. It needs to be studied and criti- 
cized. But it does supply an ordering of respondents with a high 
degree of reliability and the ultimate test will be its utility in com- 
parison with other techniques. 

Relatively few of the scales described in this volume can be strictly 
described as based on near synonymous items of the kind illustrated 
above. In Chapter 4, for example, the detailed procedures used in 
working out a scale of general attitudes toward the Army are de- 
scribed. Twelve items comprise this scale, and some of the items 


* Two people can check different response categories and still have the same attitude, 
because they differ in verbal habits. For example, one person may have a general 
tendency toward extreme statements and say “‘all of them’ to a given question, whereas 
another person may have the same attitude toward officers but express it on the same 
question by answering “most of them.” If these two responses are treated as separate 
categories they could exhibit substantial error of reproducibility, which tends to vanish 
when the categories are combined (that is, treated as though they were the same re- 
sponse). The scalogram analysis shows how best to combine categories in order to 
reduce errors of reproducibility. 
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rather closely resemble each other in format. But they are cer- 
tainly not synonymous. Consider four of them: 


In general, do you think you yourself have gotten a square deal in the Army? 
Yes, in most ways I have 

In some ways yes, in other ways, no 

No, on the whole I haven’t gotten a square deal 


In general, how well do you think the Army is run? 
It is run very well 

It is run pretty well 

It is not run so well 

It is run very poorly 

Undecided 


Do you think the Army has tried its best to look out for the welfare of 
enlisted men? 

__— Yes, it has tried its best 

——__—— It has tried some, but not enough 

——_-—— It has hardly tried at all 


Do you think when you are discharged you will go back to civilian life 
with a favorable or unfavorable attitude toward the Army? 

Very favorable 

——_— Fairly favorable 

—___—. About 50-50 

—____. Fairly unfavorable 

—_— Very unfavorable 


These items, along with eight other items also of a general charac- 
ter but varying in format, were found to cumulate, after the choice 
of appropriate cutting points, such that they satisfied the scalogram 
criteria. The operations, step by step, are described in Chapter 4. 

The greater the variety of questions, of course, the wider the 
generalization which can be made about the coverage of a particular 
scale, assuming that criteria of scalability are satisfied. 

It is recommended in Chapters 3 and 4 that a relatively large 
number of items, preferably as many as ten or twelve, be used in 
the initial testing of the hypothesis of scalability, and that, if pos- 
sible, some of these items be trichotomized rather than dichoto- 
mized. This will ordinarily make it quite difficult to achieve high 
reproducibility, but protects against spurious results which might 
be obtained by chance with only three or four dichotomous items. 

Critics of the use of only three or four items seem to have over- 
looked an important distinction, namely, that while it is desirable 
to use as many as ten or a dozen items in initial testing of scal- 
ability, a smaller number of items can be safely selected from the 
scalable list for practical research purposes. If a dozen items re- 
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veal a scale pattern, then a smaller number selected from these 
dozen will show the same pattern (though with fewer ranks). Be- 
cause they belong to the same scale, according to scalogram criteria, 
a smaller subset of items finally selected for use—possibly com- 
prising only three or four—will of necessity correlate very highly 
with other subsets of items in the same scale, as is proved in Chap- 
ter 8 and illustrated with numerical examples. Items comprising 
such a subset will have higher reliability than similar items from 
most other types of scales, since each response is a definition of the 
respondent’s position on a single continuum and is minimally cor- 
rupted with other affective material which correlates with some 
items and not with others. 

In fact, one of the important consequences of finding an attitude 
to be scalable is that one is then justified in selecting three or four 
items which can be used to order respondents in a limited number 
of ranks. It may be possible eventually to use a pretest for select- 
ing a single item for practical use—such as is the staple of much con- 
ventional market research or public opinion polling. But now the 
item is selected with full knowledge as to its place in the attitude 
structure. Intensity analysis in particular, as discussed in the last 
section of this chapter, may help determine objectively which item 
is unbiased with respect to the attitude as a whole, in the sense of 
dividing people into two groups, those favorable and those unfavor- 
able on a given issue. 

In attitude research of the future, an important desideratum will 
be to obtain simultaneous measures on a large number of continua 
from a given respondent. Scalable attitudes lend themselves par- 
ticularly well to this in practice. Why? Because each attitude can 
be represented by only a few items, so that a large complex of atti- 
tudes can be observed in a single study. Thus multiple factor anal- 
ysis might be employed to construct a typology of these continua, 
and we may well be on the road to an analysis of socio-psychological 
problems comparable to the road which has led to progress in the 
study of human abilities. 

When the hypothesis of the scale structure is found not to be borne 
out by the observed data, the attitude items may still have an or- 
dered structure, as has been sad, namely, that of the quasi scale. 
The conditions for existence of a quasi scale need more study than 
is provided in this volume. Here we are focusing attention on the 
Guttman procedure. There is, of course, the possibility that en- 
tirely different models might have fitted the data. 
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The examination of the properties of quasi scales in particular 
and of scalogram analysis in general, which was begun by Paul F. 
Lazarsfeld while a consultant in the Research Branch, led to the 
development of a fresh approach to the problem of ordering re- 
spondents, which will be described in the following section. 


2. The Latent Structure Theory 


. The conceptual models which the latent structure approach pro- 
vides are described in algebraic and numerical detail in Chapters 
10 and 11. In the present context we shall merely sketch the out- 
lines of the ideas, indicate something of their possible implications 

_for the future, and provide a numerical illustration from Research 
Branch data. While the development of the latent structure theory 
grew directly out of Lazarsfeld’s work as consultant in the Research 
Branch, and has benefited from consultation with members of the 
Branch, its detailed working out did not come until after the end 
of the war. 

The latent structure approach is a generalization of Spearman- 
Thurstone factor analysis. The basic postulate is that there exists 
a set of latent classes, such that the manifest relationship between any 
two or more items on a questionnaire can be accounted for by the exist- 
ence of these latent classes and by these alone. This implies that any 
item has two components—one which is associated with latent 
classes and one which is specific to the item. The specific com- 
ponent of any item is assumed to be independent of the latent 
classes and also independent of the specific component of any other 
item. 

The theory is quite general as to the number of latent classes. 

In order to familiarize the reader with the fundamental idea, let us 

first consider the special case where the number of latent classes is 
two. 

This special case, which is called latent dichotomy analysis, to dis- 
tinguish it from the more general concept of latent structure analy- 
sis, is quite simple to understand and, as will be shown, has some 
interesting properties. 

In applying the latent dichotomy model we seek to partition the 
sample into two classes of respondents—those who possess the 
latent character and those who do not. The latent dichotomy is, 
of course, an inference. Whether or not a given set of items can be 
fitted by a latent dichotomy has to be determined from the data, 
which ideally must satisfy a very rigorous criterion. 
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This criterion requires, as already indicated, that all of the rela- 
tionship between any two questionnaire items be accounted for by 
the hypothetical latent dichotomy. Suppose that all respondents 
had labels which permitted us to sort them into two classes—those 
possessing the latent character and those not possessing it. Our 
criterion requires that, among those possessing the latent character, 
there be no association or correlation between the responses to any 
two individual items. Similarly, among those not possessing the 
latent character, there be no association or correlation between the 
response to any two individual items. We shall make this concrete 
with a numerical example presently. 

Actually, as we shall see, the ideal criterion conditions for a latent 
dichotomy are not likely to be realized. But they may be approz- 
imately realized—at least with a close enough approximation to 
satisfy us that the model is appropriate for a given set of items. If 
80, we can then use the information obtained to order the respond- 
ents. 

For each response pattern—for example, + — + —, a pattern 
in which a person is favorable on item 1, unfavorable on item 2, 
favorable on item 3, and unfavorable on item 4—we can estimate 
how many possess the latent character and how many donot We 
can therefore calculate an estimate of the proportion who possess the 
latent character. We can do this for all possible response patterns. 
We can then put the response patterns in rank order according to 
the rank order of the proportions estimated as possessing the latent, 
character. Since the individual respondents must fall into response 
patterns, we have ordered the individual respondents, except that 
all those with the same response pattern are of course tied in rank. 

It will be seen at once that the latent dichotomy concept hypothe- 
sizes a fundamentally different structure from that of the cumu- 
lative scale model. 

The scalogram model considers the response to each item as pro- 
viding a definition of the respondent’s attitude on the subject of the 
particular attitude scale. The latent dichotomy model does not 
require us to call a response to a particular item a definition of the 
respondent’s attitude. Instead, it requires that we conceive of his 
response as having two components—one, which is a manifestation 
of a latent character and the other, which is specific to the item. 
Thus two items endorsed by a person, such as, ‘The British are 
brave fighters” and ‘‘The British are unselfish allies” may not serve 
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as definitions of the same attitude, by scalogram analysis, but may 
prove to possess a common latent dichotomy, as well as residual 
content which is specific for each item, respectively. But note— 
the residual specific content of the two items must be unrelated. 
That is, among all men possessing the latent character, those who 
say the British are brave must be no more likely to say the British 
are unselfish allies than do those who do not say the British are 
brave. All the association between these two items must be at- 
tributable to the common latent character which they possess. 

Now let us look at some numerical data, from a Research Branch 
attitude survey, which will help the reader see concretely what goes 
on when the latent dichotomy model is approximately realized. 

We shall take the same four items on general attitudes toward the 
Army which were cited in the previous section of this chapter, using 
data from a sample of 1,000 noncoms, studied in October 1945. 
Responses are dichotomized, not as in scalogram procedure by 
choosing cutting points in such a way as to minimize scalogram 
error, but according to what seems, a priori, to be the manifest con- 
tent of the response. We have: 


1. Do you think the Army has tried its best to look out for the wel- 


fare of enlisted men? 
Proportion saying, ‘Yes, it has tried ita best” .254 
2. In general, do you think you yourself have gotten a square deal in 
the Army? 
Proportion saying, “Yes, in most ways I have” .300 


3. Do you think when you are discharged you will go back to civil- 
ian life with a favorable or unfavorable attitude toward the Army? 
Proportion saying, “Very favorable” or “Fairly favorable’  .374 


4. In general, how well do you think the Army is run? 
Proportion saying, “It is run very well” or “It is run pretty 


well” 641 


Now, after responses to each of these four items are dichotomized, 
there will be 24 = 16 possible response patterns, determined from 
cross tabulation. In Table 1, column 1, the number of cases with 
each of the 16 response patterns is shown. For example, there are 
75 cases with the response pattern + + + +, favorable on all four 
items. There are 55 cases with the response pattern + — + +, 
favorable on items 1, 3, and 4 and unfavorable on item 2, etc. 
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Sorprezxs wits Various Response PaTrerns RANKED ACCORDING TO 
Proportions Posssssinc Latent CHARACTER OF FAVORABLE 
ATTITUDE TOWARD THE ARMY 


ADJUSTED NUMBER OF CASES 


—_—_—————————-__ Proportion 
Response Observed Possessing Not possess- possessing 
pattern number latent ing latent latent 
Item of cases Total character character character Rank 
(3) 
1234 (1) @ = @ + O- © 
t+++t+ 76 72.2 71.9 0.3 996 1 
t++-—-+ 42 44.0 42.7 1.3 971 2 
+-—+t+ 55 56.8 54.2 2.6 953 3 
-+++ 69 72.8 68.4 4.4 940 4 
+++- 3 6.3 5.9 0.4 938 5 
+--+ 45 43.7 32.2 11.5 736 6 
-+-+ 60 59.7 40.6 19.1 680 7 
t+-- 10 5.2 3.5 1.7 674 8 
--++ 96 90.8 61.6 89.2 668 9 
-+- 8 8.0 4.5 3.5 561 10 
-++- 16 11.4 5.6 5.8 494 ll 
--—--—+ 199 201.0 30.6 170.4 162 12 
+--- 16 17.8 2.6 15.2 148 13 
-¢-- 25 28.4 3.3 25.1 117 14 
--¢+- 52 55.7 4.2 61.5 076 15 
---- 339 226.2 3.5 223.7 011 16 
Total 1,000 1,000.0 424.3 575.7 


The reader will recognize that only 5 of these 16 response patterns 
would constitute perfect scale types in a scalogram analysis free of 
error. These, shown in boldface type in Table 1, are: 


t+e++ 75 cases 
-~++4+ 69 cases 
-——-+t 96 cases 
-~--F+ 199 cases 
--<--- 229 cases 
Total 668 cases 


The other 332 cases are distributed among the 11 response pat- 
terns which would involve error by the scalogram model. Accord- 
ing to the latent dichotomy model, however, these 11 response pat- 
terns do not necessarily represent error.’ 

7It must be kept in mind that the four items as here used are dichotomized quite dif- 


ferently from the way they were dichotomized by the scalogram analysis in which they 
served as items belonging to a cumulative scale. 
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‘The problem is to determine whether or not the respondents can 
be partitioned into two classes, those possessing and those not pos- 
sessing the latent character. We will skip now the technical details 
of how that determination is made and concern ourselves only with 
its results. Since the latent dichotomy model ordinarily will not fit 
quite perfectly, one can compute for each response pattern an 
adjusted frequency which will differ somewhat from the observed 
frequency. The adjusted frequencies are of such a nature that the 
division into latent groups can be performed with precision. The 
adjusted frequencies are shown in column 2. A test of the adequacy 
of the latent dichotomy model is the agreement between the ob- 
served frequencies in column 1] and the adjusted frequencies in col- 
umn 2. In the present example, it will be seen that the agreement 
is rather close. For example, in response pattern + + + + the 
observed number is 75, the adjusted number 72.2, an error of 2.8. 
The sum of such errors, irrespective of sign, for the 16 response pat- 
terns is 43.6 out of a total of 1,000 cases, or less than 5 per cent.® 

Now let us focus our attention on the adjusted frequencies for the 
various response patterns. By technical operations whose descrip- 
tion we shall for the present postpone, we can partition the ad- 
justed frequencies into two components, those possessing and those 
not possessing the latent character. For example, take the response 
pattern — +++. The 72.8 adjusted cases are partitioned into 
two latent classes, 68.4 who possess the latent character and 4.4 
who do not. The numbers estimated as having the latent character 
are shown in column 3 of Table 1 and the numbers without it are 
shown in column 4. On any line the sum of the numbers in columns 
3 and 4 add to the number in column 2. 

Next, we can estimate the proportion in each response pattern 
who possess the latent character. This is shown in column 5. For 
response pattern — + ++, for example, this proportion is 
68.4 + 72.8 = .940. Of course, we cannot pick out which particular 
individual possesses or does not possess the latent character. 

With our estimate for each response pattern of the proportions 
possessing the latent character we can now estimate the rank of the 
response pattern as is shown in column 6. Our task is completed, 
for we now have the respondents ordered in 16 rank groups accord- 
ing to the proportion possessing the latent character. 

Now let us look again at Table 1 and see empirically what is 


® Appropriate tests of goodness of fit and acceptance standards are still under investi- 
gation. 
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meant by saying that there is no association between the items ex- 
cept that due to the latent dichotomy. Consider, for example, items 
1 and 2 only. For those possessing the latent character, let us pool 
the results for all response patterns in which items 1 and 2 are both 
positive. We have, from column 3 in Table 1: 


+444 71.9 
+4+4- 5.9 
+4+—4+ 42.7 
++-- 3.5 
Total 124.0 


Similarly, pooling the results for all response patterns in which 
item 1 is positive and item 2 is negative, we have: 


+-++ 542 


+-4+- 4.5 
+--+ 32.2 
+--- 2.6 

Total 93.5 


For all response patterns in which item 1 is negative and item 2 is 
positive we have 117.9 and for all response patterns in which both 
item 1 and item 2 are negative we have 88.9. Now let us form the 
following fourfold table: 


Item 1 
- + 


+ 117.9 124.0 
Item 2 [——$_—____| 
- 88.9 93.5 


Within the limits of error introduced by rounding, it will be seen 
that there is no association. In other words, the ratios of the num- 
bers in the first row to the numbers in the second row are nearly 
identical. We get 1.3262 and 1.3262, identical to the fifth signifi- 
cant figure. Thus, among respondents possessing the latent char- 
acter, there is no association between the specific components of 
items 1 and2. Analogously, we can find that the same holds among 
respondents not possessing the latent character and it holds for any 
pair of items, not merely items 1 and 2. 

It must be noted again that the absence of association between 
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the specific components of any two items is achieved by basing the 
calculations on an adjusted number of frequencies (column 2 in 
Table 1). It would no longer quite hold if the percentages in col- 
umn 5 were applied to the original observed frequencies in column 1. 
That is because the original data do not perfectly satisfy the criterion 
for the existence of a latent attribute. But we are making no serious 
error if we assume, say, that all the 75 cases in response pattern 
(+ + + +) have the same proportion possessing the latent at- 
tribute (.996) as the 72.2 estimated cases after adjustment. For, 
as we have seen, over the whole table less than 5 per cent of the 
1,000 men wou!'d be misclassified thereby. 

The reader should study Table 1 and the discussion of it carefully 
in order to grasp the basic logic of the latent structure theory as ap- 
plied in this simplest of models, namely that of a latent dichotomy. 
Some words of caution are now in order. In particular, the reader 
may ask, what sort of confidence can one place in column 5 in Table 
1—the proportions possessing the latent character among persons 
with a given manifest response pattern? It is necessary to issue a 
warning here that these proportions may be subject to several dif- 
feremt sources of instability or sampling variability; for example: 

(a) The kind of variability involved in test-retest, repeating the 
same four items on the same population of subjects. 

(b) The kind of variability involved in using the same four items 
on two different populations of subjects. 

(c) The kind of variability involved in substitution of a new 
“parallel form” of item for each item initially used. 

As the number of questions satisfying the model increases, the 
number of scale patterns increases and the proportions for a scale 
pattern converge toward zero or unity. The number of persons 
with scale patterns such that one cannot be pretty sure that they 
do or do not possess the latent character tends to become fewer and 
fewer. 

It follows from these considerations that the ordering of persons 
into 16 rank groups as in Table 1 is an empirical result which is sub- 
ject to several kinds of unreliability. It is possible that the 16 
rank orders thus derived would be considerably more stable than the 
rank orders derived from, say, four items which do not satisfy the 
latent structure criteria. However, a rigorous analytical job on 
this remains to be done. 

The crucial thing accomplished by the operations exhibited in 
Table 1 is that it provides evidence of the degree to which the mani- 
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fest items satisfy criteria of uwnidimensionality. The need for such 
criteria we must keep at the forefront of our thinking. This is ac- 
complished not perfectly in practice, but quite adequately with the 
latent dichotomy model in Table 1 or with more complicated models 
to be described presently. 

Even if the values, in Table 1, column 5, of the proportions for a 
given manifest scale pattern who possess the latent character should 
“hop around’? somewhat, under the conditions itemized above, it 
still seems useful to apply them in a given empirical investigation as 
an ordering scheme. In practice, we may wish to combine sixteen 
such patterns into two or three broad groups. Thus, if we inspect 
Table 1, we will see that the following manifest response patterns 
have proportions higher than .90 possessing the latent character: 


Response pattern p 
i ie wa .996 
++-+4+ 971 
ai 953 
Sete ae .940 
t++t+- .938 


From column 1 of Table 1 it will be seen that these five response 
patterns include 244 of the 1,000 individuals. Moreover, it will be 
seen that 521 of the 1,000 individuals have one of the following re- 
sponse patterns with values of p lower than .20: 


Response pattern p 
---+ 152 
+--- .148 

_ 117 
Sb .076 
- O11 


Now if we assume, provisionally, that the 244 with p > .90 are 
quite likely to have the latent character—call them group A—and 
the 521 cases with p < .20 are quite likely not to have the latent 
character—call them group C—we have two fairly pure extreme 
groups. The remaining 235 cases are indeterminate—call them 
group B.° 

® It happens that group A contains all who were positive on at least three items, while 
group C contains all who were positive on only one item or less Thus the grouping is 
the same as might have been arrived at by conventional item analysis. But we now 
know much more about the properties of the items than ordinary item analysis would 
have told us. Moreover, it is quite possible that a tabulation of the kind of Table 1 


would show that a pattern like — + + + may actually have a smaller proportion pos- 
sessing the latent character than a scale pattern like + - — +. 
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For practical use, either for the purpose of extracting the top or 
bottom class for special study or for the purpose of correlating 
groups A, B, and C with an outside criterion, such a combination of 
manifest scale patterns, guided by the p values, seems quite justi- 
fiable. Of course, taking a cutting point like p = .90, as in the 
above example, is quite arbitrary. Any other cutting points, like 
p = .95 or p = .67, might be chosen, depending on the degree of 
“purity” desired in the extreme group. Even if we took p = .50 
we would misclassify only about an eighth of our respondents. 

We must recognize that in the case of the perfect latent dichot- 
omy, with an indefinitely large number of manifest items, there are 
logically only two rank groups. Later, we shall describe latent 
structures which logically postulate more than two ranks. Neverthe- 
less, it would be shortsighted, at least in the light of our present 
knowledge, not to use information of the type provided by Table 1, 
column 5, to order respondents into more than two empirical rank 
groups—particularly when our aim is to segregate those persons who 
are highly likely to possess the latent character or not to possess it. 

The whole subject of the reliability of such groupings needs much 
more thorough theoretical and empirical study than it has been pos- 
sible to make at the time these volumes go to press. In Chapter 10 
Lazarsfeld has outlined the reliability problem and has suggested 
certain conditions which may justify expectations that, if the latent 
structure model is applicable, a relatively small number of items 
may be practically as serviceable as the much larger number of items 
conventionally employed in the absence of knowledge of the under- 
lying structure. 

For the ideal case, we can state analytically what is involved in 
the latent dichotomy approach by use of the following scheme for 
m items: 


Latent class Items 
frequency 1 2 3 4 sis m 
ny Pu Pi Pis piu cee Pim 
nit Pim Pir Pius Pi tee Pilm 


Let the total number of respondents be n. Let the number of re- 
spondents favorable on item 1 be m, on item 2, m, etc. Let the 
number jointly favorable on items 1 and 2 be m2, on items 1 and 3, 
M3, etc. Let the number jointly favorable on items 1, 2, and 3 be 
M2, On items 1, 2, 3, and 4 be niu, etc. 

We seek to classify the population into two latent classes, one 
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class with frequency 7; and one class with frequency m1 = n — nx. 
We need a set of 2m parameters pn, Pr, +++, Pim aNd Pin, Pin, ***, 
Pim The value of pn is the proportion, among all n; respondents, 
who are favorable on item 1. The value of pin is the proportion, 
among all n:: respondents, who are favorable on item 1, etc. 

We must have: 


mtnan=n 
mipu + mipin = 
MP2 + NUP = Ne 


NPim + N1u1Piim = Nn 


Now there must be no association between any two items among 
respondents belonging to a particular class. We have seen that, 
within class I, the proportion positive on item 1 is pn and the cor- 
responding proportion positive on item 2 is pr. Hence, if within 
class I, items 1 and 2 are to be independent, the proportion within 
class I who are jointly positive on items 1 and 2 must be pn-pn. 
Similarly, the proportion within class I who are jointly positive on 
items 1 and 3 must be pn-pis, etc. The same type of conditions 
must hold within class II—for example, the proportion within 
class II who are jointly positive on items 1 and 2 must be pin- pin. 
Hence all relationships must be of the type: nrpnpru + nipimpme 
=N12. 

Similarly, the proportions within class I of joint occurrence of 
positive references to items 1, 2, and 3 must be the product of the 
proportions within that class with positive responses to items 1, 2, 
and 3, that is, pn-pP12-p13; and analogously for all other joint occur- 
rences involving any number of items. 

The values of nz and ni and of the proportions pn, Pr, etc., must 
be calculated from the data. When the latent dichotomy model 
does not fit the observed data perfectly as has been indicated, it can 
be made to fit an adjusted set of data. For the adjusted set of fre- 
quencies illustrated in Table 1, the values of the calculated param- 
eters were: 


Latent class Items 

frequency 1 2 8 4 

m = 424.3 pu = 5125 pon = 5704 pi = 6276 pu = .9240 
nau = 575.7 pin = .0635 pi = .1008 pirs = .1871 ping = .4324 
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With these numerical values known one can compute straight- 
forwardly the values shown in columns 3 and 4 of Table 1, by pro- 
cedures shown in detail in Chapter 11.” 

This exposition has used a single numerical illustration to help the 
reader grasp the basic idea of latent structure analysis, namely, that 
all the association between any two items is due to the latent attri- 
bute, and that there is no association between the specific com- 
ponents of the items. For the formal mathematics the reader is 
referred to Chapter 10. For the detailed explanation of how the 
adjusted frequencies are obtained when an approximate solution is 
necessary, the reader is referred to Chapter 11. In that chapter, 
each of the numerical steps in constructing Table 1 is shown in full, 
and the student may wish to repeat the operations step by step in 
order to appreciate more fully the assumptions and implications. 

It should be said that certain of the problems of indeterminacy 
which haunt quantitative factor analysis appear in latent attribute 
analysis. Work with this new tool is still too young to have de- 
veloped a completely standardized set of criteria for determining 
when the approximations made are “good enough” approximations. 
Such standards doubtless will be forthcoming, but they wait upon a 
large amount of further empirical and theoretical investigation. 
For example, two items which seemed by a priori inspection of con- 
tent to be in the same general attitude area as the four items in our 
illustrative example were substituted for items 2 and 4. Although 
by conventional item analysis techniques these new items should 
belong in the scale, they did not satisfy the criteria for the latent 
dichotomy model. 

As was said at the beginning of this section, the latent dichotomy 
model is merely a special case of the more general theory of latent 
structure analysis. Instead of two latent classes we may have 
latent classes in the following scheme: 


Latent class Items 
frequency 1 2 8 4 oe m 
ny Pu Pr Pi Pi wee Pim 
nr Pur Pitz pus Pia see Pim 
113604 Put Pitz Pir Pris eee PUInm 
nh Pm Pra Pus Pu vee Pym 


%” For example, the number possessing the latent character among those with scale 
pattern + + + + in Table 1 is nipupupiupy = 71.9. 
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For the mathematical proofs one is referred to Chapter 10. The 
basic logic is exactly the same as in the latent dichotomy case, ex- 
cept that there are now ) latent classes instead of only two. The 
values of nz, ni1, ---, N, ald pn, Pre, etc. are so determined that all 
the association between the responses to any two of the original 
items can be accounted for by the latent classes and by these alone. 

It should be noted that unlike the latent dichotomy case, there 
is in the most general case no necessary intrinsic cue for ranking 
respondents. Only if the data possess certain special characteristics 
can a rank order be assigned to m1, m1, --- and 7, and hence a basis 
be provided for ultimately ranking respondents. 

One of these special cases is of very great interest to us, however, 
since it corresponds closely to the scalogram case described in the 
first section of this chapter. When \ = m + 1, we may find that 
the quasi scale in scalogram analysis can be described as follows: 


Latent class Items 

frequency 1 2 8 4 
m1 l-—a l—a l—a l~a@ ... l—a@n 
ni a l-—-a@a l—-—-a l—-agy ... 1 — dn 
Tau a a l—a l—a@m ... 1 — an 
Tm Q as as a 88 l-a, 
Thm 41 ay a a3 a Sere an 


Here pr = 1 — a; where a; is a quite small fraction; px = 1 — a, 
where a, is also a small fraction; etc." 

AS Q), G2, ---,@m tend to zero, we approach the conditions for a 
perfect scale in the sense of scalogram analysis, and at the limit we 
have: 


Latent class Items 

frequency 1 2 3 4 m 
n1 1 1 1 1 1 
ni 0 1 1 1 1 
nit 0 0 1 1 1 
Nn 0 0 0 0 as 1 
Nn $1 0 0 0 0 2 0 


The theoretical implications of the fact that scalogram analysis 
may be shown to constitute a special case of latent structure theory 
need much further study. 

1 Tt is not necessary that all a’s in a column, for example, be equal. But the special 


case in which they can be assumed to be very small and equal lends itself readily to 
computation because of the limited number of parameters involved. 
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In Chapter 11 a numerical example is worked out in detail ap- 
plying latent structure analysis to data which form a quasi scale by 
the scalogram model. These are data on psychosomatic complaints 
in a sample of 1,000 soldiers. Four items are used in this illustra- 
tion: 


1, Have you ever been bothered by pressure or pains in the head? 


2. Have you ever been bothered by shortness of breath when you were not 
exercising or working hard? 


3. Do your hands ever tremble enough to bother you? 
4. Do you often have trouble in getting to sleep or staying asleep? 


The values of p in the structure, based on response to these four 
items, are as follows: 


Latent class Items 
frequency 1 2 3 4 
nt .9640 .8703 8473 .9326 
nu .0360 .8703 8473 .9326 
ni .0360 .1297 8473 .9326 
Nr .0360 .1297 1527 .9326 
Ny .0360 .1297 1527 0674 


It is shown in Chapter 11 that when these values are used to dis- 
tribute among the five latent classes the respondents who have any 
given response pattern (such as + + + +) the results are very 
satisfactory. An intrinsic basis exists for ranking the response pat- 
terns and hence the respondents. Since approximations are in- 
volved, further study is needed as in the latent dichotomy case to 
set standards of acceptable approximations. 

The scalogram model requires, of course, that the items tend to be 
cumulative. This restriction is also necessary when the latent struc- 
ture analysis is applied to the scalogram picture as a special case. 
But there are other special cases of latent structure theory, which 
permit ordering of respondents without requiring the items to be 
cumulative. Consider the following picture: 


Latent class Items 
frequency 1 2 3 ee m 
m1 pu* Piz Dis vr Pim 
ni Pm pu* Pus we Pilm 


nut Pim pire purs* oes Pilim 


Rn Pm Pm2 Pm3 cre Pmn* 
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Here the items and the latent class frequencies are so arranged that 
the largest values of the p’s (denoted as p*) are on the diagonal. 
It is assumed that the values of pin, pin, «++, Pm diminish rapidly 
and monotonically from py,*. Similarly pis < pris < piiis* > Pivs 
> +++ Dm, etc. This would appear to represent the type of phe- 
nomena present when items have been ordered on a Thurstone-type 
equal-appearing interval scale. Because of difficulties involved in 
computing the parameters, no numerical example is as yet available. 
One approach to the problem would be to cut down the number of 
parameters by imposing further symmetrical restrictions. 

A particularly interesting special case of latent structure analysis 
is one of the cases where there are only three ordered latent classes. 
The algebra is such that arithmetic computation is comparatively 
simple, if we are willing to restrict the number of parameters in- 
volved in p to two per item. At the same time much flexibility in 
patterns of p values is possible. A given item could take on any of 
the following six patterns of p values: 


Latent class Value of p Latent class Value of p 
I a; + by I ay + by 
(1) II ay (4) II C7 + by 
il a Ill au 
I a2 I as + bs 
(2) I ae + bs (5) II as 
III a TI as + bs 
I as J ae 
(3) II as (6) II ag + be 
III a, +b; III as + be 


As long as parameters are limited to two per item, computation 
difficulties are not substantially increased by increasing the num- 
ber of latent classes. The values of a and b are free to vary from 
item to item and we are free to use any particular pattern or combi- 
nation of patterns with any number of items. Such a pattern is 
illustrated in Chapter 11 with a numerical example from Research 
Branch data on job satisfaction. 

A pattern such as one of the six above can be thought of as a dis- 
crete simplification of a continuous trace line p(x) which is the func- 
tion of a latent continuum z with an infinite number of classes. In 
introducing the concept of latent structure analysis, in Chapter 10, 
Lazarsfeld begins with the continuous case. He develops the theory 
for the continuous case and then considers a variety of special cases 
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in which there are a finite number of latent classes with varying 
patterns of p. Thus the six patterns given just above may be 
thought of as discrete approximations to curves of the second degree. 

Among the most important contributions of Chapters 10 and 11 
are preliminary tests enabling the investigator upon initial inspec- 
tion of his data to decide which of various special cases would pro- 
vide the more appropriate model for use on these particular data. 
These tests involve an examination of the first-order determinants 
formed from cross tabulation of each item with each other. The 
reader will find it particularly instructive to compare and contrast, 
in Chapter 11, the table of these determinants for the four items on 
attitude toward the Army which approximately fit a latent dichot- 
omy pattern (page 419), with the set of determinants for the four 
items on psychosomatic complaints which are part of a quasi scale 
by the scalogram model (page 443). Each has its own pattern of 
internal consistency and the two patterns are strikingly different. 

There are various additional theoretical and practical implica- 
tions of latent structure theory, as outlined in Chapters 10 and 11. 
Latent dichotomy analysis has an especially interesting application 
to the problem of shifts in attitude by the same respondents at two 
time points. In Chapter 11 a numerical illustration from Research 
Branch data is given to show how the method serves in the analysis 
of shifts in attitudes toward officers. 

As compared with the eight chapters devoted to scalogram theory 
the two chapters on latent structure analysis hardly do justice to 
the problems they raise. But the developments there reported are 
still too new to have associated with them the relatively large back- 
ground of experience which is reported in the other chapters. In 
particular, computation techniques are usually much more difficult 
than with scalogram analysis and need simplification. Approxi- 
mation criteria need further standardization—which is also true of 
some aspects of scalogram analysis. 

Whether or not the latent structure theory lends itself readily to a 
large variety of practical applications, its generality and its sug- 
gestiveness as a model for the reduction of qualitative data make it 
worthy of much further exploration. 


Ireoigy 
3. Utilizing Indexes of Intensity of Attitude to Locate 
a “Zero Point” or “Region of Indifference’ 


Thus far in this chapter we have previewed some of the contri- 
butions of this volume toward the problem of establishing among 
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respondents a rank order with respect to a particular attitude con- 
tinuum. 

In the last year or so of the Research Branch, progress also was 
made on the problem of establishing a ‘“‘zero point” or “‘region of 
indifference’ which would provide an objective basis, independent 
of particular question wording, for distinguishing relative propor- 
tions of pros and cons on an issue. 

It developed that one of the most interesting theoretical prop- 
erties of scalogram analysis was the fact that, in the ideal case of 
perfect scalability, the ranking of people with which we have thus 
far been concerned corresponds to but one of a set of principal com- 
ponents. In fact, as Louis Guttman proves in Chapter 9, if there 
are m response patterns or ranks there are m principal components. 
The conditions of perfect scalability, by scalogram analysis, lead to 
the result that each component score has a definite relation to the 
rank order of the people, and there is a definite law of formation. 

The first component is a monotonic increasing function of the 
ranks (a straight line in the special case when the frequencies are 
the same for each response pattern). It is this first component 
with which we have been concerned in the problem of ordering 
people on a content scale. 

The second component will always be a U-shaped or J-shaped 
curve with one bend in it. It need not be symmetrical, though in 
the special case where each response pattern has the same frequency 
it will be a symmetrical parabola. This second component has 
proved especially interesting, since it can be interpreted as a meas- 
ure of intensity, leading theoretically, in the ideal case, to the 
determination of an objective zero point, at the point where the 
curve of intensity is a minimum. 

The third principal component has two bends in it, the fourth 
three bends, and in general, the ith principal component has 7 — 1 
bends. As yet little progress beyond the speculative stage has been 
made in investigating the psychological implications of the higher 
components, except for the third, which Guttman calls closure. (See 
Chapter 9, page 313.) 

The first half of Chapter 9, in which Guttman introduces the 
theory of principal components, is written quite simply, for the non- 
mathematical reader. It is addressed to all psychologists and social 
scientists who are concerned with the problems of interpreting the 
socio-psychological implications of ordered structures. It will re- 
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pay the most careful study, as will, of course, the formal and rather 
difficult mathematical treatment in the latter part of the chapter. 

The general idea of the first and second component will now be 
illustrated with a simple hypothetical example which has been de- 
veloped more fully in Chapter 9. Consider the following scalogram, 
with all types of persons equally frequent and all types of item re- 
sponses equally frequent: 


Type of person Type of category 
(rank) Says yes to item Says no to item 
12 8 4 6 12 8 4 6 

5 KX X X X X 

4 X X X X ».« 

3 X X X X X 

2 xX X xX X X 

1 >.< X X X X 

0 xX X X X X 
Frequency of response 12 3 4 5 5 4 3 2 1 


Now we wish to assign a numerical score to each type of person 
and a numerical weight to each item response category. In the 
above example we want 6 person scores and 10 item weights. 

The scores for persons should satisfy the following condition: All 
people who fall in one category of an item should have scores as 
similar as possible among themselves, and as different as possible 
from the scores of people in another category of the item. The 
total variance of persons’ scores can be expressed as the sum of two 
parts: the variance of the scores within categories and the variance 
of scores between categories. Maximizing the similarities within 
categories and differences between categories implies maximizing 
the square of the correlation ratio." 

The weights for items should satisfy the following condition: All 
categories characterizing one person should have numerical weights 
as similar as possible, and as different as possible from the weights 
for categories which do not characterize that person. Maximizing 
similarities within people and differences between people calls for 
determining that set of numerical weights for all the item categor'es 
which will have the largest possible correlation ratio with respect to 
all the people. 

Guttman has found that the maximum possible correlation ratio 


13 The correlation ratio is the square root of the ratio that the variance of scores be- 
tween categories has to the total variance. 
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for scores is identical with the maximum possible correlation ratio 
for weights. He shows that the optimum score of a person is pro- 
portional to the arithmetic mean of the weights of the item re- 
sponse categories by which that person is characterized, and the 
optimum weight of a category is proportional to the arithmetic 
mean of the scores of the people who are in it. 

The problem becomes one of finding a set of scores which “go in 
a circle’’—that is, scores which yield weights that give back the 
same scores, differing from initial scores only by a constant of pro- 
portionality. Such a set of scores is called a principal component 
of the system. Let us now assign a set of scores to each rank type 
of person in the foregoing diagram as follows:” 


5 to each response by a person in rank type 5 
3 to each response by a person in rank type 4 
1 to each response by a person in rank type 3 
—1 to each response by a person in rank type 2 
—3 to each response by a person in rank type 1 
—65 to each response by a person in rank type 0 


Then we have the following table: 


Type of person Type of category Sum of Average 
(rank) Says yes to item Says no to item weights —_ score 
1 2 8 4 6& 1 2 8 4 6 
5 5 5 5 5. 5 25 5 
4 3 3 3. 3 3 15 3 
3 1 1: 1 1 1 5 1 
2 -1 -1 -1 -1-1 —5 -il 
1 -—3 —3 -3 -3 -—3 —15 -—3 
0 -§ -5 -5 -5 -5 -—25 —5 
Sum of weights 5 8 9 8 5 —5 —8 -—9 —8 —5 
Average weight 5 4 3 2 1 —-1 -2 -3 —4 -5 


Now let us use the bottom row of numbers—the average weight for 
each item category—to compute a new score for each type of person. 
(For example, for a person with rank type 5 the weights 5 + 4+ 3 
+2 +1 = 15 or an average score of 3.) For the whole table we 
have: 

us These scores are not arbitrary, but are the only set of scores (except for multiples 
thereof) which will satisfy the requirements of circularity for the first principal com- 


ponent. For details as to how these scores may be derived, the reader is referred to 
Chapter 9. 
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Type of person Type of category Sum of Average 
Says yes to tlem: Says no to item weights score 
1 2 8 4 6& 1 2 8 4 6 

5 5 4 3 2 1 15 3.0 

4 4 3 2 1 -1 9 1.8 

3 3 2 1 —-1 -—2 3 6 

2 2 1 -1 -—2 -3 —3 —.6 

1 1 —-1 —2 -3 -—4 —9 -18 

0 -1 —-2 -3 -4-5 —-—15 —3.0 


Let us now compare the average score just obtained with that ob- 
tained in our initial trial: 


(21) (ys) 


Type of Average score on Average score on 
person initial trial second trial 
(circular score) 

5 5 3.0 

4 3 1.8 

3 1 6 

2 -1 —6 

1 -—3 —18 

0 —5 —3.0 


We see that the new sets of scores for types of persons is a linear 
function of the scores on our initial trial, namely y1 = .62;. A con- 
tinuation of this process will always lead to new scores which are 
exactly .6 times the previous scores. The set of 2: scores we call 
the first principal component. 

Now let us postulate and assign a different set of scores which also 
will be seen to go in a circle. The scores will be 5, —1, —4, —4, 
—1, 5. The ranks of these scores are no longer according to the 
scale rank order. Instead the scale rank score is ‘‘folded over.” 
We have: 


Type of person Says yes to item Says no to item Sum of Average 
1 2 4 6 1 2 3 4 & weights score 
5 5 5 5 5 5 25 5 
4 —-1 -1 —1 -1 -l —5 -1 
3 —4 -4 -4 —-4 -4 —20 —4 
2 —4 -4 —4 -4 -4 —20 —4 
1 -1 -1-1-1-1 —5 -—1 
0 5 56 5 5 5 25 5 
Sum of weights 5 4 0 -—4 —5 -5 -4 0 4 5 


Average weight 5 2 0-1-1 —- -1 0 2 65 
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Entering the average weights from the bottom row, we have on our 
second trial: 


Type of person Says yes to item Says no to item Sum of Average 
2 8 4 6 1 2 8 4 & weights score 

5 5 2 0-1-1 5 1.0 

4 2 0-1-1 —1 -1 —.2 

3 0-1 —1 —-1 —1 —4 — 8 

2 -—1 -1 -1-1 0 —4 —8 

1 -1 -1-1 0 2 -1 —.2 

0 -1-1 0 2 5 5 1.0 


Comparing the average scores thus obtained with that which was 
obtained in our initial trial, we have: 


(x2) (ys) 
Type of Average score on Average score on 
person initial trial second trial 
(circular score) 

5 5 1.0 

4 -1 —.2 

3 —4 — 8 

2 —4 —.8 

1 -1 —.2 

0 5 1.0 


We see that the new set of scores for the type of persons is a linear 
function of the scores on our initial trial, namely, y2 = .222. A con- 
tinuation of the process will always lead to new weights exactly .2 
times the previous scores. The 22 scores constitute the second 
principal component. 

What we have been considering are only two of the principal com- 
ponents. Further analysis will reveal in the present example five 
principal components, that is, five sets of scores which have the 
property of circularity exhibited above. 


Type of person Principal component scores 
I II III IV 4 
5 5 5 5 1 1 
4 3 —1 -7 —3 —5 
3 1 —4 —4 2 10 
2 -1 —4 4 2 —10 
1 —3 -1 7 —3 5 
0 —5 5 -—5 1 -1 
Constant of proportionality 6 2 al 06 .04 


It will be observed that the constants of proportionality (ob- 
tained through each respective circular operation) diminish with 
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each component. Since each constant of proportionality is, in fact, 
the square of the correlation ratio of a given component on the items, 
this means that each successive component accounts for a diminish- 
ing amount of the total variance in item categories. It also will be 
observed that when we add the five constants of proportionality we 
have 


6+ .2+4+.14 06+ .04=1 


that is, the sum of the squares of the correlation ratios is unity. 

Principal components are, as Guttman observes, important con- 
cepts in mathematics and physics. Even though each item in- 
volves the same single factor as any other item, it differs in how it 
combines the components. ‘Each item represents,” in Guttman’s 
words, ‘‘a different variation on the same theme. Another way of 
expressing this is that each item represents a different combination 
of the tones and overtones of the same basic note.” ™ 

In Chapter 9, Guttman presents a detailed mathematical treat- 
ment of the problem and derives a number of further interesting 
properties of these functions. For example, he shows that though 
each principal component, in the ideal case, is a perfect curvilinear 
function of the rank order, the linear correlation between the prin- 
cipal components is zero." He suggests that the comparatively 
simple law of formation as to their oscillations may prove to make 
the scalogram model of special theoretical importance among all the 
various models for ordering qualitative data. For scales which do 
not satisfy the scalogram criteria, the principal components, though 
always linearly uncorrelated as in the scalable case, have no simple 
pattern of dependencies and can, in fact, be completely independent. 
In general, there appears to be no thing, such as exists in the scal- 
able case, as an inevitable U-shaped or J-shaped second component 
which might be identified as intensity, nor a third component which 
always has two bends, ete. 

How important this theory of principal components will become, 
in enhancing the utility of scalogram theory as a tool for future in- 
vestigation, it is much too early to say. It must be remembered 
that Chapter 9 deals with the ideal case, and perfect scalability by 

44 Other names which have been given in the literature to principal components in- 
clude principal axes, latent vectors, characteristic functions, characteristic vectors, 
eigenfunctions, and eigenvectors. 

16 The reader can verify this in our simple numerical example above for any two com- 


ponents. For example, for II and IV we get (5-1) + (—1-—3) + (—4-2) + (—4-2) 
+ (—1-—3) + (5-1) = 0. 
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the scalogram theory or any other theory is not likely to be found 
in practice. 

Some months passed after Guttman’s discovery of the principal 
components mathematically before a psychological interpretation 
of the second component was realized. The clue came from a paper 
by Daniel C. Katz in Hadley Cantril (editor), Gauging Public Opin- 
ton. Practical procedures for studying the intensity function were 
devised jointly with Edward A. Suchman, and it was Suchman who 
supervised the empirical research. 

This empirical work is reviewed in Chapter 7, with several fully 
worked-out examples with Research Branch data. A few examples 
of intensity analysis were carried out in time to find a place in Re- 
search Branch studies which were not merely methodological. 
Illustrations appear in Volume I, Chapters 5 and 8 of the present 
series of books. 

To find the intensity function it is necessary to develop, inde- 
pendently of the scale of content, a scale of intensity with which re- 
spondents felt about a subject. Two techniques were developed to 
arrive at such a scale of intensity. One was to follow each content 
item with a second item more or less like the following: 


How strongly do you feel about this? 
Not at all strongly 

Not so strongly 

Fairly strongly 

_____ Very strongly 


Irrespective of the direction, pro or con, of the response.on the pre- 
ceding attitude item, a given intensity response, along with other 
intensity responses which were preceded by other attitude items, 
was used to form a scale of intensity. In no instance did the in- 
tensity items form a scale satisfying the criteria of scalogram 
analysis but they did ordinarily form a quasi scale. 

An alternative method, which is called the fold-over technique, 
sought to derive the content and intensity scales from the same set 
of questions. For example, consider the item: 


Do you think when you are discharged you will go back to civilian life 
with a favorable or unfavorable attitude toward the Army? 
Very favorable 
___. Fairly favorable 
—______ About 50-50 
Fairly unfavorable 
____ Very unfavorable 
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Such an item, as one of a set of items in an attitude scale, was now 
scored twice. First, it was scored for direction of content, in the 
usual scalogram manner. Second, it was scored for intensity—for 
example, the respondents who checked either top or bottom cate- 
gories might be given a plus score and those who checked any of the 
three middle categories might be given a minus score. Intensity 
scores from such items were then combined to form a scale— 
actually, a quasi scale—of intensity. The fold-over technique was 
attractive, because it reduced the length of the questionnaire and 
relieved the respondent of a certain tedium. In a few studies it 
yielded about the same results as the longer method. But in other 
studies, it was less satisfactory. While the shorter procedure needs 
further study, the present disposition is to recommend the longer 
form wherever it is possible. 

In Chapter 7 several numerical examples are provided of the 
empirical relationship between curves of intensity and scales of con- 
tent for a variety of Army subject matter. In general, as predicted, 
a J-shaped or U-shaped curve was obtained with attitude data. 

In the ideal case, the minimum point on the curve of intensity 
marks the zero point of the content scale. Empirically, however, 
there were a number of instances where the base of the U was broad 
and nearly flat, extending over several ranks. This phenomenon 
would seem to indicate a relatively broad ‘‘zone of indifference” and 
would make the selection of any single point as a minimum point 
somewhat hazardous, since the one point slightly lower than others 
might be so by chance. The sampling error of the minimum point 
has not been worked out and may not lend itself to a ready solution. 
Pending the development of better methods for designating a single 
point as a zero point in instances where the “‘zone of relative indif- 
ference” is broad, without a sharply defined minimum, one should 
treat the “zero point’? with considerable caution. One procedure 
which has been suggested is to establish a “zone of indifference” 
arbitrarily by finding that rank at which each arm of the U-shaped 
curve has, say, median intensity. If a larger proportion of people 
with greater than median intensity are favorable than are unfavor- 
able, this can be reported as a fact, along with the relative propor- 
tion favorable or unfavorable on either side of the ‘“‘zero point.” ' 

A crucial property of any satisfactory procedure for determining a 


18 This procedure could, of course, produce two contradictory conclusions. If so, 
and if the “zero point” is not sharply defined, the only safe conclusion may then be that 
the “zone of indifference” is broad, without specification of proportions favorable or 
unfavorable. 
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pro and con cutting point must be its independence of particular 
item wording. 

In Chapter 7 it is reported, with numerical examples, that the 
methods there described seemed to have met this test quite satis- 
factorily. For example, a sample of six items from a scale of atti- 
tudes toward officers was used in which the proportion favorable on 
individual questions ranged from 8 to 43 per cent. This was called 
an unfavorably biased sample of items. Another sample of six items 
from a scale of attitudes toward officers was used in which the pro- 
portions favorable on individual questions ranged from 58 to 90 per 
cent. This was called a favorably biased sample of items. The re- 
spondents were the same on the two sets of items. It should be 
noted, parenthetically, that if one conventional public opinion poll- 
ing agency had happened to use the first set of questions, whereas a 
second agency had happened to use the second set of questions, they 
would have come to opposite conclusions about the average propor- 
tion of men favorable to their officers. At least, they would have 
done so in the absence of some technique, which was independent of 
particular item frequency, for determining the proportions pro and 
con. For each of the two sets of items an intensity analysis was 
carried out. As is shown in detail in Chapter 7, approximately 80 
per cent of the men were on the unfavorable side of the minimum in- 
tensity point, irrespective of whether the favorably biased or unfavorably 
biased items were used. 

Further experimental work, to test the independence of the ‘‘zero 
point”’ or ‘region of indifference” from particular question wording, 
is needed on a wider variety of problems than the Research Branch 
had time to study in this way. If past experience continues to stand 
up in practice, there can be hardly any room for doubt about the 
utility of this device. 

An interesting by-product of the empirical study of intensity func- 
tions has been the exploration of a concept of generalized intensity. 
This study came about through an effort to account for the rather 
wide dispersion of scores observed around intensity curves at all 
intervals. 

Can it be, it was asked, that, irrespective of the particular attitude 
continuum under investigation, some people habitually are more 
vigorous in their responses than others? If so, their intensity of 
response to a set of items on a particular attitude continuum might 
need to be discounted. That is, the fact that they responded with 
apparent intensity to these attitude items might merely reflect gen- 
eral verbal habits and not necessarily strong feeling about the atti- 
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tude in question. The effect would be to increase the variability of 
responses around the intensity function in this or any other attitude 
area. 

As described in Chapter 7, the problem was studied by construct- 
ing a quasi scale of generalized intensity, based on intensity of feel- 
ing expressed about a very wide variety of problems. The re- 
spondents were then broken into three groups, those with high, 
medium, and low generalized intensity, respectively. Selecting, for 
example, attitudes of men toward the Women’s Army Corps 
(WAC), one computed intensity functions separately for these 
three groups of respondents. Each intensity function represented 
the relation between the content score of attitude toward the WAC 
and the specific intensity score belonging to this specific attitude 
area. The three functions had the same shape and very nearly the 
same minimum point, but the three functions formed a nest, with 
that for the men with high generalized intensity at the top and that 
for the men with low generalized intensity at the bottom. (See 
Chart I.) Necessarily, the variability around each of these three 
functions considered separately was less than around a single func- 
tion formed by pooling all the cases irrespective of generalized in- 
tensity. 

This type of exploration has important implications for attitude 
research in general and public opinion polling in particular, for it 
‘shows how responses to items involve mere verbal habits of expres- 
sion which can lead to especially misleading results on individual 
items, as well as on scales. One person can say he feels very strongly 
opposed to something and yet be no more strongly opposed than an- 
other person who says that he feels somewhat opposed, the former 
being a person given to generally strong verbal habits and the latter 
to generally moderate verbal habits. The illustration given in the 
preceding paragraph showed why, if generalized intensity is not 
controlled, the variability around the specific intensity function will 
be rather large. It also gave encouraging intimations—which need 
to be tested on a very much wider variety of data—that even if the 
variability is considerable, the methods developed in the Research 
Branch will tend to yield a consistent ‘‘zero point” or ‘region of 
indifference” whatever the generalized level of intensity. 

The implications of the theory of principal components and its 
empirical applications—especially, the second principal component, 
which in the ideal case is interpreted as the intensity function—need 
much further study. The ideas are not yet widely enough known 
to have received the kind of discerning criticism and new explora-~ 
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tion which is advancing understanding of those aspects of scalogram 
theory described in the first section of this chapter. 


CHART I 


Sreeciric INTENsITY FoNcrions FoR RESPONDENTS aT THREE LEVELS OF 
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Concluding Comments 


The next ten chapters seek to present, as compactly as possible, 
due to present high printing costs, sufficient mathematical proof and 
numerical illustration to give the reader a good beginning in the 
understanding of the kind of thinking represented in the new ap- 
proaches growing out of the work of the Research Branch. 

Chapters 2 to 9 describe various aspects of scalogram analysis 
and Chapters 10 and 11 describe latent structure analysis. Most 
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of the chapters have been written primarily for the nonmathemati- 
cian, the exceptions being parts of Chapters 9 and 10. All the 
illustrative numerical examples are from Research Branch data. 

Each chapter is systematically organized and the attempt has 
been made to present the ideas as lucidly as possible consistent with 
rigor. Yet the reader is warned that these pages will require study 
and in places may be somewhat difficult, even where not primarily 
mathematical. Each chapter, of course, represents the point of 
view of its author and only a minimum editorial effort has been 
made to reconcile differences in terminology and in point of view, 
especially as between chapters written by Guttman and by Lazars- 
feld, respectively. 

In conclusion, it may be well to reiterate the point made at the 
beginning of this chapter, namely, that the purpose of this research 
is to throw new light on the problem of constructing theoretical models 
of ordered structures or scales and testing the applicalnlity of a particu- 
lar model to a particular set of qualitative data. Once we have found 
an appropriate model, we are in a position to operate with a rela- 
tively small number of items to describe a particular dimension, if 
the area is scalable by one or another rigorous criterion. Such short 
scales will be economical to use. When, in a given broad attitude 
region, such a set of quite specific short scales has been found, we 
should then be in a position to proceed with a multidimensional 
study of the area, testing theories as to the interrelationships of the 
scales used. This study may require us to try to reduce the num- 
ber of these initial dimensions by some form of quantitative factor 
analysis. But the first step is to have a set of clean unidimensional 
scales and it is toward the attainment of this first step that Research 
Branch methodological energies were mainly directed. 

As must be apparent from the preview presented in the present 
chapter, there is still relatively little which has sufficiently passed 
out of the realm of controversy to reach a definitive textbook stage. 
In the history of science, only a small fraction of the proposed scien- 
tific models, even including many which have certain initial attrac- 
tions, find a permanent place. As more and more of the younger 
psychologists and social scientists, aware of the central importance 
of the problems here investigated, put their minds to investigation 
of these problems, we may expect models to take shape which will 
hold their place in science. Among those models may be some 
whose development has been furthered by the Research Branch 
and by the discussion, criticism, and creative inventiveness which 
it is hoped these chapters will stimulate. 


CHAPTER 2 


THE PROBLEM OF ATTITUDE AND 
OPINION MEASUREMENT ' 


he two problems. Two of the most important problems of all 

measurement concern (1) the determination of unidimension- 
ality and (2) the determination of a fixed point of reference along 
such a single dimension. Measurements to be meaningful should 
be along only one dimension at a time. Once a single dimension 
has been isolated, it often becomes important to find some fixed 
point to which measures along this single continuum can be an- 
chored. 

The social sciences have long been concerned with both of these 
problems. Various techniques have been proposed for the determi- 
nation of a single continuum, but, as will be discussed later, most 
existing tests for unidimensionality do not appear to be based upon 
any clear-cut rationale. With regard to the second problem of a 
fixed point of reference, little progress seems to have been made. 

Our purpose is to offer a new theory and its concomitant tech- 
niques in answer to the two problems. The technique of scalogram 
analysis® provides a simple method for testing a series of qualitative 
items for unidimensionality, while the technique of the intensity 
function provides a simple method for finding a meaningful, objec- 
tive cutting point along a single continuum. 

The next chapters will discuss the problem of scalogram analysis, 
presenting in detail the basic theoretical foundations for scalogram 
analysis, a detailed outline of the scalogram board technique, illus- 
trations of the utility of scale analysis in different problems, a dis- 
cussion of the problems of reliability and validity for the present 
theory of scale analysis, and finally an evaluation of the present 


1 By Louis Guttman. 

2? Throughout the remainder of the chapters in this section, the terms scalogram 
analysis and scale analysis will be used interchangeably. The phrase “scale analysis” 
will refer only to the present technique of scalogram analysis. 
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theory of scale analysis as compared to existing methods for testing 
data to see if they lie along a single continuum. 

After this discussion of scale analysis, the theoretical basis for the 
intensity function will be presented, followed by several illustrations 
of intensity curves and a demonstration of the independence of this 
curve and its cutting point from specific question wordings. Finally, 
the general equations of the components of scales will be derived, 
which in particular include the intensity component. 

Scale analysis is formal. The problems of dimensionality and of 
an unbiased cutting point are of crucial importance in attitude and 
opinion research. Therefore, the present introductory chapter is 
devoted largely to a discussion of the definition and uses of atti- 
tude and opinion. The reader who wishes to proceed immediately 
to the theory and practice of scale and intensity analysis can omit 
the present chapter. The formal requirements for a scale hold re- 
gardless of what it is that is being studied, so that the principles of 
scale analysis hold in particular for any approach to attitude and 
opinion research. 

Altitudes versus opinions? The fields of attitudes and opinions 
have often been thought of as presenting different problems in 
definition and measurement.*? Differences in definition have been 
characterized by such terms as “controversial” (opinions) versus 
‘noncontroversial” (attitudes), or ‘‘expressed” (opinions) versus 
“underlying” (attitudes). Differences in measurement have been 

* There are several comprehensive summaries of attitude and opinion research to 
which the reader can refer for a more extensive treatment than is given here, especially 
in regard to specific surveys. Murphy, Murphy, and Newcomb present an excellent 
summary of specific surveys up to 1937; Schmeidler, Allport, and Veltfort summarize 
surveys during the war years; Sherif and Cantril present a detailed analysis and sum- 
mary of studies on the formation and change of attitudes. Review of methodology 
may be found in Riker, Cantril, and especially in McNemar. This latter review and 
summary might well serve as an introduction to the problems studied in the present 
research. 

G. Murphy, L. B. Murphy, and T. M. Newcomb, Experimental Social Psychology 
(Harper & Bros., New York, 1937). 

G. W. Allport and H. R. Veltfort, “Social Psychology and the Civilian War Effort,” 
Journal of Social Psychology, S.P.S.S.I. Bulletin, Vol. 18 (1943), pp. 165-233; G. R. 
Schmeidler and G. W. Allport, ‘Social Psychology and the Civilian War Effort: May 
1943-May 1944,” ibid., Vol. 20 (1944), pp. 145-180. 

Muzafer Sherif and Hadley Cantril, The Psychology of Ego-Involvements (John Wiley 
& Sons, Inc., New York, 1947). Also, Sherif and Cantril, ‘The Psychology of ‘Atti- 
tudes’: Part 11,”’ Psychological Review, Vol. 53, No. 1 (1946), pp. 1-25. 

B. L. Riker, ‘A Comparison of Methods Used in Attitude Research,”’ Journal of 
Abnormal and Social Psychology, Vol. 39, No. 1 (January 1944), pp. 24-42. 

Hadley Cantril (editor), Gauging Public Opinion (Princeton University Press, Prince- 
ton, N.J., 1944). 

Quinn McNemar, “Opinion-Attitude Methodology,” Psychological Bulletin, Vol. 43, 
No. 4 (July 1946), pp. 289-374. 
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described in terms of “polling”? (opinions) versus “scaling” (atti- 
tudes), or ‘‘dividing” (opinions) versus ‘‘ranking’’ (attitudes). 

It seems generally agreed that ‘“‘opinion’”’ should refer only to ver- 
bal behavior, whereas an “attitude”? may be either verbal or non- 
verbal. If we look at the research carried out on attitudes in the 
past, however, we find it is based largely on verbal behavior. There 
seems little point, then, in trying to distinguish between attitudes 
and opinions on this basis. We shall not attempt to define “opin- 
ion” as a distinct concept, but rather use the word interchangeably 
with “attitude”? when dealing with verbal behavior. We shall use 
the word “attitude” to include “opinions” and also nonverbal be- 
havior. When we use the phrase “attitude and opinion,” this will 
be only to emphasize that both verbal and nonverbal character- 
istics are under consideration. 

Polling versus scaling? The historical development of public opin- 
ion polls has followed somewhat different lines from what is ordi- 
narily considered attitude research. Differences in particular tech- 
niques used by polling agencies from those developed for the general 
field of attitudes seem to have given rise to the impression in some 
quarters that opinion polls have different basic methodological prob- 
lems from those of the general field. 

Perhaps the outstanding difference is that opinion pollsters ask 
but a single question on a given issue, instead of a battery of ques- 
tions as used generally in an attitude study. This is because poll- 
sters are ordinarily interested in population percentages rather than 
in scoring each individual. Opinion polls cast their results into 
statements like ‘75 per cent of the population is favorable,” or ‘35 
per cent of the population prefers this.”” A more complete attitude 
study would want to say, “Person A is more favorable than Person 
B who in turn is more favorable than Person C. . . .” 

If several questions are asked, then it is natural to inquire as to 
whether or not they are all tapping the same thing. Hence, the 
general concern of attitude students with the problem of scalability 
or dimensions of the responses. Is there a rank order for the people? 

If but a single question is asked, then it is quite easy to succumb 
to forgetting about the problem of dimensions, but to worry pri- 
marily only about a cutting point. Is the question unbiased in its 
division of people into ‘favorable’ and ‘‘unfavorable?”’ 

A single question cannot provide any data to test for dimension- 
ality, but this should not mislead one into believing that therefore 
the problem does not exist. If there is more than one dimension of 
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content in the question, what meaning can a cutting point have? 
On what is it that the population is ‘“favorable’’ or “unfavorable?” 

The problem of bias cannot really be tackled until it has been 
established whether or not a single dimension is present in the first 
place.* Hence, the problem of bias probably cannot be solved by 
the use of only a single question. Public opinion polling is subject 
to the same basic problem of scalability as is the general field of 
attitude research. 

In view of the foregoing considerations, it does not seem profitable 
methodologically to distinguish between opinion polling and other 
attitude research. The need for investigating dimensionality of 
responses remains in either case. Let us now turn to a closer view 
of the concept of attitude itself. 

Definition in terms of observations. In this section is proposed an 
approach to a definition of attitude that is intended to promote re- 
search. A complete definition 1s not attempted. Only a necessary 
condition to be incorporated into a definition is suggested, namely, 
that an attitude be defined in terms of a delimited totality of be- 
havior. Specifying behavior as a necessary condition can aid useful 
research without waiting for a complete definition. 

A basic premise accepted here is that a scientific concept must be 
defined in terms of observations; it may be defined directly in terms 
of the observations, or by operations on the observations. A second 
premise is that a definition is scientifically useful only in so far as it 
leads to objective research. According to these premises, any 
sociological concept must be based on observations of human be- 
havior and will be useful only to the extent that the requisite obser- 
vations can be made and analyzed rigorously. In particular, these 
considerations pertain to the concept “attitude.” 

Many authors have proposed approaches to defining an attitude.® 
Two notions that seem most common to their discussions are that 
(a) an attitude is a predisposition to act in a certain way toward 
something (subject-object relationship), and that (b) it is an infer- 
ence from previous behavior. 

Let us examine these notions in the light of our basic premises. 
In order for them to provide a complete definition, the predisposition 
and the inference must be pinned down. The behavior to be pre- 


‘ This undoubtedly accounts for the unrewarding results of the numerous subjective 
suggestions proposed by many writers on how to word questions so as to make them 
“unbiased.” 

* See summaries in the sources referred to in footnote 3. 
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dicted, the behavior from which inferences are to be drawn, and the 
process of inferring the predisposition must each be defined. 

Prediction not sufficient. Merely to say that an attitude is that 
which helps predict certain behavior, of course, is not sufficient to 
define the term as conventionally used. It can be predicted that a 
person will behave in a certain manner, yet it would be agreed that 
the basis for the prediction should not be called attitudinal. For 
example, from a knowledge of the amount of education an enlisted 
man has had, we can predict to a certain extent how often he will 
go AWOL. But we would not say that a man’s education should 
be an integral part of a definition of attitude toward Army regula- 
tions; it is a correlate and not a component. Relatively accurate 
predictions can be made of human behavior without direct reference 
to attitudes, and hence are insufficient to define an attitude. 

Useful research on prediction requires that the predictor and the 
thing to be predicted must be defined separately and independently. 
The research problem is then the empirical one of finding out how 
well the predictor actually predicts one or more other variables. 

Therefore, although how well an attitude can predict one or more 
other things will be of interest, the attitude cannot be defined merely 
as something that predicts. Two objections to such an approach 
are that (a) nonattitudinal variables also predict; and (b) to regard 
empirical correlates of an attitude as part of its definition makes 
nonsense of research. Prediction or correlation is not sufficient to 
define an attitude. 

Similarly, prediction does not seem to be a necessary component 
of a formal definition of attitude. It may be that when attitudes 
are defined, they will yield useful predictions for some problems and 
less useful predictions for others. But if prediction is to be incor- 
porated into the formal definition, then it must be stated how ac- 
curate itis. Must the prediction be perfect? If not perfect, and if 
predictability is measurable by correlation coefficients, is a corre- 
lation of .8 sufficient, or is .5, or is .01? 

Perhaps even more crucially, if prediction is to be part of the 
definition, then the thing to be predicted—the criterion—must be 
defined. Shall the attitude of a person toward a candidate for elec- 
tion have as a criterion the voting behavior of that person on elec- 
tion day? Then those who are under-age, illiterate, or cannot vote 
for any other reason constitute a problem: do they or do they not 
have an attitude toward the candidate? 

It seems implicit in conventional discussions of attitude as a pre- 
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disposition that behavior depends not only on the attitude but also 
on the situation. A person may hate his boss, but smile at him 
every day. Conceivably, then, attitudes may fail completely to 
predict some criteria. 

If a definition of attitude could be made which involved prediction 
as a component, and which would facilitate research, this would be 
highly desirable. We believe that this has not yet been done, and 
that the problems raised in the preceding discussion militate against 
its being done. Attempting to define a predictor in terms of criteria 
to be predicted seems to lead to ambiguities and circularities. An 
alternative approach is to define attitude apart from any one cri- 
terion, and to regard prediction as an empirical correlate, rather 
than as acomponent. That is, after an attitude has been defined, 
the problem of prediction can then be tackled in a straightforward 
manner; the empirical correlations of the attitude with any outside 
variable desired can be ascertained. The attitude will be found to 
predict some things well in some situations, and some things poorly 
in some situations. In this way, many ambiguities are avoided. 

We believe that in the long run better predictions will be made on 
the basis of research using such an approach. Preliminary evidence 
that predictions are facilitated by focusing on defining things in 
their own right is afforded by scalogram analysis (which is discussed 
in detail in the following chapters). In scalogram analysis, the uni- 
verse of content is defined apart from any ideas of predicting outside 
variables. Then, if the content is scalable, it turns out that any 
outside variable whatsoever can have its maximum predictability 
determined in a very easy fashion. 

The concept of an attitude universe. How can an attitude be de- 
fined without explicitly using the notion of prediction? An ap- 
proach that seems to have possibilities of yielding a rigorous and 
useful definition is to consider an attitude to be a delimited totality 
of behavior with respect to something. For example, the attitude of a 
person toward Negroes could be said to be the totality of acts that 
a person has performed with respect to Negroes. A discussion of 
the problem of how to delimit such a totality is reserved for later. 

Such an approach conforms to our premise for a definition by 
being based on observations. For the example of attitude toward 
Negroes, people can be observed with respect to how they behave 
toward Negroes in various kinds of situations. A person who votes 
against providing educational facilities for Negroes, sneers at 
Negroes when he sees them well dressed, speaks of them in deroga- 
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tory terms whenever he does talk about them, etc., could be classi- 
fied as having one kind of attitude toward Negroes, and a person 
doing other things could be classified as having another kind of 
attitude toward Negroes.® 

The utility of defining an attitude as a delimited totality of be- 
havior depends on how complicated are the interrelationships be- 
tween the various acts involved. It must be possible in practice to 
characterize the entire population whose attitude is being investi- 
gated on all the behavior involved in the definition. It should be 
possible to accomplish this either by (a) actual observation of each 
individual on each act; or (b) by finding that the interrelations between 
the acts are such that from only a sample of acts the whole configuration 
can be reproduced. If the number of acts is indefinitely large, a 
definition of attitude as a totality of acts can be useful only if the 
acts have a sufficiently simple pattern of interrelationships which 
will enable the whole to be reproduced fairly accurately from but a 
part. A scalogram pattern is one example of such a simple pattern 
of interrelationships. 

One way of arriving at a workable definition in this sense is not to 
attempt to bite off too much in one chunk. For example, if a com- 
plete study of all behavior toward Negroes could be made, it might 
be found that this configuration was far too complex to be manage- 
able as a whole, but that on the other hand, parts of the universe 
could be defined separately in a manageable form. The behavior 
of people toward Negroes when they meet them on the street might 
be proved to be a manageable configuration, or the behavior of 
people with respect to employing Negroes might be proved to be a 
manageable configuration; it would then be useful and profitable to 
speak of various attitudes: attitude toward Negroes when meeting 
them on the street, attitude toward employing Negroes, etc. The 
total attitude toward Negroes would then be the configuration of 
these subuniverses. A specific research project would rarely call 
for such a totality, but more often only for selected subuniverses. 

Classifications and subclassifications. A totality or universe of 
behavior can usually be regarded as a subuniverse of a larger uni- 
verse, and can itself often be divided into subuniverses. 

A subarea of behavior is itself of interest if the larger area of which 
it is a part is of interest. Furthermore, the relationships of a sub- 
area to outside variables can be studied and be useful regardless of 


* Note the avoidance here of saying that one person is “more” favorable than another 
toward Negroes; it is not assumed that attitudes are necessarily scalable. 
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the role of the subarea in the total area. In an external prediction 
problem, if an adequate prediction can be made from only a sub- 
area of an attitude, then that may be all that is needed for practical 
purposes; it need not be necessary to study the entire attitude. 

By recognizing the different subuniverses that can be in an atti- 
tude, one can avoid the temptation of saying that a particular sub- 
universe is “really” the attitude. For example, it has been found 
that Negro soldiers will respond differently to questions put by 
Negro interviewers than to the same questions put by white inter- 
viewers. Instead of asking which situation elicits the “real” atti- 
tude of the respondents, it seems far more profitable to recognize 
that here are two distinct, though related, subuniverses. If it is 
desired to predict something external from these responses, it may 
be found that the responses involving the Negro interviewer will 
help predict how a respondent will act in the company of Negroes 
with respect to the issue at hand; whereas, the responses elicited by 
white interviewers may better predict how the Negro will behave 
in a white environment. And most certainly, knowledge of the 
behavior in both the white and Negro interview situations will in 
general provide better external predictions than will knowledge of 
only one of the subuniverses. 

The role of questionnaire research. An interesting feature of cur- 
rent research is the use of questions in stimulating responses. The 
behavior performed by a respondent on a written questionnaire is 
either to place a check mark opposite a category of the printed an- 
swers to a printed question, or to write out a response by himself. 
In an interview situation, this writing is done by the interviewer 
himself. If the content of the questions is, for example, expression 
of opinion about the respondents’ officers, then the responses are 
classifiable as a subuniverse of the men’s attitude toward their offi- 
cers. 

Such a subuniverse of behavior of responding to a questionnaire 
is, of course, not the same as other behavior with respect to officers; 
and the relationships it has with other subuniverses of the attitude 
is a matter for empirical investigation, as is any question of inter- 
relationship between subareas. 

Questionnaire responses at present are by far the most manage- 
able kind of subuniverse of attitudinal behavior. If the attitudes 
of a population are under investigation and if it is not possible to 
observe the entire population on the totality of acts, then there are 
two kinds of sampling problems involved. One is the sampling of 
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acts, and the other is the sampling of people. Although the uni- 
verse of verbal behavior observable by the questionnaire technique 
is indefinitely large, an adequate sample can probably be incor- 
porated into an administrable questionnaire, and a proper sample 
of people can also be obtained.’ Many items can be included in a 
questionnaire and responded to in a very short period of time, 
whereas other kinds of subuniverses of behavior may take place 
over long periods of time and be difficult to observe systematically 
for a satisfactorily large group of people. 

Like any other subuniverse, questionnaire responses can be used 
to predict outside variables if desired, the amount of predictability 
being a matter to be ascertained empirically. If sufficient pre- 
dictability can be obtained from such a subuniverse alone, then 
there may be no need to investigate the other subuniverses of the 
attitude for that particular problem. 

The insufficiency of “known-group validation.”’ Is it true that the 
attitude-as-behavior approach can lead to improvement in research? 
Let us examine how it bears on two conventional techniques. One 
can be called that of known-group validation, and the other that of 
the single criterion. 

The procedure of “known-group validation” consists of beginning 
with an informal judgment of one or more groups of people with 
respect to their attitude. The behavior defining the attitude is not 
explicitly stated beforehand, but it is agreed to accept comparisons 
of the groups with respect to this undefined attitude. An illustra- 
tion might be that of “esprit de corps.” The “known-group”’ ap- 
proach for this is to say, for example, “‘We don’t know exactly what 
the term signifies, but we can agree that whatever ‘esprit de corps’ 
is, the paratroops in the Army have more of it than the Infantry. 
Therefore, let us try to find observations that will best discriminate 
between paratroopers and infantrymen.”’ 

A patent weakness of such an approach is that it cannot provide a 
definition. Many things could be found that would discriminate 
between paratroops and Infantry, but would not be called “esprit 
de corps.” As an extreme example, the shoes men wore (at least 
until late in the war) would discriminate very sharply between para- 
troops and Infantry; but a consensus of research workers would un- 
doubtedly reject shoes as part of the definition of ‘esprit de corps.”’ 


7 The processes of sampling people and of sampling items are not at all identical; ran- 
dom sampling, stratified or not, is used for the first, but is not applicable to item con- 
struction. 
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This approach cannot distinguish between a definition and its corre- 
lates. 

A less obvious weakness in this indirect technique is its implicit 
assumption that an attitude is necessarily unidimensional, that 
individuals or groups can be meaningfully ranked on it from high 
to low. If an attitude is composed of several multidimensional ele- 
ments, there is no need for each of the separate elements to dis- 
criminate between groups even though the composite does discrimi- 
nate. It is a commonplace of multiple correlation that zero-order 
relationships may have little bearing on higher order relationships. 

Such an indirect validation approach 1s not designed for defining 
concepts; it seems intended only to help in the absence of a definition, 
and what help it can give is necessarily very limited. Criterion 
groups can be useful as a stimulation to intuition in cases where it is 
not clear what the problem is. By asking, ‘‘What behavior led us 
to think of these groups as criterion groups?” research workers can 
be led to define one or more classes of acts that can be agreed on as 
being the behavior the researcher is interested in studying. From 
the point of view of attitude-as-behavior, the important thing is 
finally to be clear as to the acts that are to be considered intrinsic 
to the attitude. In so far as the criterion groups prove to clarify 
the universe of interest, they are an effective informal aid; a danger 
is to focus so much on the groups as to lose sight of the fact that 
possible discrimination between them is only a correlate of the atti- 
tude or attitudes to be studied. 

Dangers of the regression approach. A related indirect approach 
widely used is that of attempting to predict a single criterion. As 
an example, consider the notion of job satisfaction. The approach 
of the single criterion requires that the attitude of job satisfaction— 
when defined—should correlate as highly as possible with actually 
leaving or staying on the job when it is a matter of free choice (or 
some similar criterion). Things that could help predict whether or 
not a person will leave a job, given his free choice, would be as- 
sembled into a composite to predict the criterion; and the regression 
of the criterion on the composite would be called an index of job 
satisfaction. The attitude itself is not defined, but the index is 
supposed, in some sense, to represent the attitude. 

For the purpose of predicting whether or not a man will leave his 
job, the procedure of assembling a composite and coming out with 
a prediction variable is, of course, the proper procedure. It should 
not, however, be inferred that thereby a concept is necessarily de- 
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fined. How this would be an improper inference can be seen in 
several ways. 

One way is to suppose that a composite can be assembled that 
would yield a prediction variable that allows perfect prediction. 
There is a curious aspect to such a composite, namely, that it is 
never unique. If more variables are added to the composite, the 
enlarged composite will necessarily also yield perfect prediction, no 
matter what the additional variables are. This is an immediate conse- 
quence of the definition of multiple regression; adding variables can 
not reduce predictability. Furthermore, in general it should not be 
expected that there is a unique smallest subset of variables in the 
composite that will yield perfect prediction. It seems plausible that 
two or more different subsets of predictors will yield the same per- 
fect prediction, if perfect prediction is possible at all. 

Similarly, in the case where a composite yields imperfect predic- 
tion—which is the usual case in practice—there are in general sev- 
eral different subsets (or several other composites) which will yield 
equally good predictions.* Furthermore, for this case of imperfect 
prediction, there can be negligible correlations between the indexes.’ 

If it is desired to predict some external variables from the attitude, 
and if all that is available is an index which correlates but imper- 
fectly with the attitude, it is very unsatisfactory merely to correlate 
the external variable with this index. Which of the possible indexes 
should be used? Even though the indexes should correlate equally 
with the attitude, their correlations with an external variable can 
vary considerably, just as the correlations among themselves can 
vary considerably. It is possible for an index to predict the outside 
variable far better than does the attitude, as well as far worse than does 
the attitude. 

And even further, the correlation of an external variable with a 
particular index is in general less than the multiple correlation of the 
external variable on the observations from which the index was de- 
rived. The multiple regression of the attitude on the prediction 
composite is in general different from the multiple regression of an 
external variable on the same composite, so that the regression for 


8 For an analytical discussion of the lack of intrinsic meaning of multiple regressions, 
see sections 19, 20 and 21 in Louis Guttman, ‘An Outline of the Statistical Theory of 
Prediction,” in P. Horst, et al., The Prediction of Personal Adjustment (Social Science 
Research Council, New York, 1941), pp. 287-292. 

® As a numerical example from linear correlation, consider the case where each of two 
composites correlate .7 with the criterion. It can be proved that the only restriction 
on the correlation between the two indexes is that it be between —.02 and +1.00. 
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the one variable is not the most appropriate to be used for the other. 

In view of all this, it seems that defining an attitude by resorting 
to an index can lead only to ambiguous research. 

The characterization of attitude as a delimited totality of be- 
havior is intended to de-emphasize the indirect approaches which 
characterize so much of current studies. The ambiguities of indi- 
rection are almost of necessity irresolvable. A direct approach 
such as outlined here seems better designed to clarify and to struc- 
turalize research problems into a usable form, and the problems that 
attend studying attitude-as-behavior have the possibility of being 
solved. 

Internal validity and external validities. Perhaps the major reason 
why so much research has been predicated upon indirect and incom- 
plete validation procedures has been lack of recognition of the fact 
that in general there are two quite distinct kinds of validity. One 
kind we shall call ‘internal’ and the other kind “external.” 

“Internal validity” is the problem of definition. In defining a 
universe of behavior, the test of ‘internal validity” for each item 
in the universe is its content. Does the content of the item belong 
in the universe? Only a judgment of the content can answer this, 
according to our approach. For example, does the following prob- 
lem belong to a universe of knowledge of arithmetic: “How much is 
two plus two?” Or does the following item belong in a universe of 
opinion about the presidential qualities of Mr. Truman: ‘Do you 
think Truman is doing a good or a bad job as President?” If one 
wishes to study knowledge of arithmetic, then one asks arithmetic 
questions. If one wishes to study opinions about the President, 
then one asks questions about the President. What questions go 
into each of such universes is determined by the content involved. 
Each item with the proper content of the universe has “internal 
validity” for that universe. 

“External validity” is the problem of prediction. A universe has 
but one “internal validity,” but 7 has many possible “external validi- 
ties,” since it can be used for many different prediction purposes. 
Knowledge of arithmetic may have some validity for predicting 
success as an accountant, a different validity for predicting success 
as an engineer, etc., etc. Opinion of President Truman may have 
some validity in predicting who will vote for him on election day, a 
different validity for predicting who will support his program on 
price control, ete., etc. A particular “external validity” for a uni- 
verse cannot be determined except by experiment. The thing to 
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be predicted must first be defined, and then observations must be 
made to determine the correlation between the predictor and this 
particular criterion. Generally, the correlation will not be perfect. 
In the social sciences, we rarely find that one variable can be pre- 
dicted well from another. Ordinarily, a battery of predictors is 
needed to attain a substantial correlation with a criterion. To pre- 
dict success as an accountant, it should be helpful to consider not 
only knowledge of arithmetic, but other universes as well. To pre- 
dict well who will vote for a presidential candidate, other things 
besides opinion of that candidate should be taken into account. 
Hence, the many “external validities’’ of an attitude may be far 
from perfect by themselves. 

The ‘‘external validities” of an attitude cannot be studied until 
the attitude itself is defined. ‘‘Internal validity’? must be agreed 
on first. After the attitude is defined, then any particular ‘‘ex- 
ternal validities’” can be explored empirically. Some may be high 
and some may be low, but, regardless, it should be clear that ‘‘ex- 
ternal validities’” are empirical correlates of, and not integral cri- 
teria for, “internal validity.” 

Accuracy as an “external validity.”” The distinction between “‘in- 
ternal” and “external validity” can be illustrated by one kind of 
prediction problem, that of accuracy. In a poll or survey, some 
questions may be used which attempt to reproduce some external 
or objective fact, and some questions may be used to elicit opinion. 

If a man is asked to state his age, one “external validity’’ of his 
response would be the accuracy with which it corresponds with data 
recorded on his birth certificate. This particular external criterion 
on the birth certificate is defined apart from the man’s response, 
and can be observed without ever asking the man any questions. 
The man’s response is a separate variable which belongs to a uni- 
verse of all questions that could be asked of the man about his age; 
it has “internal validity” for this universe of questions because of 
its content. 

What a man says about his age is one thing, and what the birth 
certificate records is another; the relationship between the two is 
something that is to be determined empirically. But, regardless of 
this relationship, regardless of this particular “external validity’’ of 
the man’s response, the response can have high “external validity” 
for other prediction problems. Even if the response does not ac- 
curately represent birth certificate data, it is a statement of the man 
about himself, and such a statement can quite often predict other 
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things very well. A man’s statement about his age might not be 
accurate chronologically, but it might prove to be a good predictor, 
say, of his personal adjustment or other areas defined externally to 
it. Therefore, to say that a man’s statement about his age is “not 
valid” is very incomplete; what is ordinarily meant by this is that 
it has little “external validity” for one particular problem. 

In opinion questions, there is less danger of losing focus by seizing 
upon a particular external criterion, because there ordinarily is no 
supposedly factual counterpart to the opinion response which can 
be obtained independently of the respondent. To inquire into the 
“‘accuracy” of opinions does not in general make much sense, then. 
Perhaps what is referred to by such an inquiry are problems concern- 
ing the sampling of questions and of bias. How well does a par- 
ticular opinion question represent the universe from which it was 
drawn? Does it divide the population properly into pros and cons? 
Such problems are the concern of scale and intensity analysis, and 
are internal to the universe. They are not problems of “external 
validity.” 

The following chapters are devoted to the study of the internal 
structure of a universe of attitude and opinion items (or of any other 
universe of qualitative data), and show how knowledge of this struc- 
ture is essential for an analysis of such qualitative data. 


CHAPTER 3 


THE BASIS FOR SCALOGRAM ANALYSIS‘ 


NE of the fundamental problems facing research workers in the 

field of attitude and public opinion measurement is to deter- 

mine if the questions asked on a given issue have a single meaning 

for the respondents. Obviously, if a question means different things 

to different respondents, then there is no way that the respondents 

. can be ranked in order of favorableness. Questions may appear to 

express but a single thought and yet not provide the same kind of 

stimulus to different people. The responses even to the simplest 
question can differ in kind as well as in degree. 

That two people can give the same response to the same question 
and yet have different attitudes has long been of primary concern 
to public opinion pollsters. This is particularly the case when more 
than one question is asked about the same topic, and the replies to 
the different questions appear to be inconsistent. Consistency of 
response is a problem that plagued attitude research long before 
public opinion polling captured the fancy of the public. How can 
one tell if there is enough consistency in the responses of a popula- 
tion to a series of questions to indicate that only a single factor is 
being measured? Are all respondents interpreting the questions to 
mean the same things? Are differences in responses due only to 
differences in degree of feeling and not to differences in kind? Is it 
meaningful to score the people from high to low with respect to a 
given set of items? 

This problem of consistency underlies a great deal of research in 
the social and psychological sciences dealing with large classes of 
qualitative observations. For example, research in marriage is con- 
cerned with a class of qualitative behavior called marital adjustment 
which includes an indefinitely large number of interactions between 
husband and wife. Public opinion research is concerned with large 


1 By Louis Guttman. A bibliography of published articles on scalogram analysis is 
given at the end of this chapter. 
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classes of behavior like expressions of opinion by Americans about a 
military treaty with the British. Educational psychology deals 
with large classes of behavior like achievement tests. Other prob- 
lems in social and psychological research where the consistency 
underlying a series of items is of fundamental importance include 
the study of aptitudes involved in fitting people into jobs, the meas- 
urement of human intelligence and abilities, the study of neurotic 
behavior and other aspects of personality, the appraisal of social 
status—in short, any problem involving the assigning of numerical 
values to qualitative observations in an attempt to evolve a single 
rank ordering. It is often desired in such areas to be able to sum- 
mInarize data by saying, for example, that one marital couple is 
better adjusted than another marital couple, or that one person has 
a better opinion of the British than has another person, or that one 
student has a greater knowledge of arithmetic than has another stu- 
dent. 

While the data to be presented in the present chapter will deal 
almost entirely with social attitudes or opinions, the approach is ap- 
propriate for all of the types of problems mentioned. This rather. 
new approach seems to afford an adequate basis for the quantifi- 
cation of many types of qualitative data. 

The approach to scale analysis to be presented here has been used 
successfully during the war in investigating morale and other prob- 
lems in the United States Army. While some interesting mathe- 
matics lie in the background, no knowledge of this mathematics is 
required in actually analyzing data. Simple routines have been 
established which require no knowledge of statistics, which take less 
time than the different manipulations now used by various investi- 
gators (such as critical ratios, biserial correlations, factor analyses, 
etc.), and which give a complete picture of the data not afforded by 
any of these other techniques. The word “picture” might be 
interpreted here literally, for the results of the analysig are presented 
and easily assimilated in the form of a “‘scalogram,” which gives the 
configuration of the qualitative data. 


Theory of Scale Analysis 


A main condition for an attitude scale has often been pointed out 
by psychologists. For example, Murphy, Murphy, and Newcomb 
say that “no scale can really be called a scale unless one can tell 
from a given attitude that an individual will maintain every atti- 
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tude falling to the right or to the left of that point.”’? A similar 
consideration is the starting point for our theory of scale analysis. 
Instead of focusing on the ranking of items, however, we focus on 
the ranking of individuals. The ranking of items apparently is re- 
stricted to dichotomous items, where a person either endorses or 
does not endorse a statement. In such a case, it is possible to con- 
sider a ranking of endorsements, so that if a person endorses a more 
extreme statement, he should endorse all less extreme statements if 
the statements are to be considered a scale. But if the items have 
more than two categories, such a consideration breaks down; “agree” 
to one item might be equivalent to, or even less “favorable” than, 
“‘undecided’”’ to another item, so that there remains a problem of 
how to rank items and response categories. 

The ranking of people provides a more general approach to the 
problem of scaling, since it turns out to be equivalent to the ranking 
of items when all items are dichotomous, and it also includes the 
case where items have more than two answer categories. We shall 
- call a set of items of common content a scale if a person with a higher 
rank than another person is just as high or higher on every item 
than the other person. This involves no problem of ranking the 
categories of one item against those of the other items, but only 
needs a ranking of the categories within each item. 

An equivalent definition of a scale for our approach is that, within 
each item, if one response category is higher than another, then all 
people in the higher category must have higher scale ranks than 
those in the lower category. 

A third equivalent definition of a scale is the one upon which our 
practical scalogram analysis procedures are directly based. It re- 
quires that each person’s responses should be reproducible from his 
rank alone. A more technical statement of the condition is that 
each item shall be a simple function of the persons’ ranks. The 
meaning of this is expanded below, where it is easily seen that the 
three definitions of a scale just given here are all equivalent. This 
third definition, while perhaps the least intuitively obvious, has 
proved to be the most convenient formulation for practical pro- 
cedures. Of course, when it is fulfilled, the first two definitions are 
also fulfilled; and when the items are dichotomous, then the, psy- 
chologists’ condition first mentioned is also fulfilled. 


?G. Murphy, L. B. Murphy, and T. M. Newcomb, Experimental Social Psychology 
(Harper & Bros., New York, 1937), p. 897. We shall use the word “‘item’’ where they 
use “attitude.” 
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Murphy, Murphy, and Newcomb further note that, “As a matter 
of fact there is every reason to believe that none of the rather com- 
plex social attitudes which we are primarily discussing will ever con- 
form to such rigorous measurement.” * Perhaps such a belief may 
account for the fact that the mass of current attitude research pays 
little or no attention to this fundamental rationale. The common 
tendency has been to plunge into analysis of data without having a 
clear idea as to when a single dimension exists and when it does not. 
For example, bivariate techniques—like critical ratios and biserial 
correlations—are commonly used to find items that ‘‘discriminate”’ 
and to determine “weights,” without testing whether or not the 
multivariate distribution of the items is actually indicative of a single 
dimension. 

One of the main purposes of this chapter will be to propose a rigor- 
ous definition of a scale. This definition applies not only to the 
study of general attitudes, but also to the study of public opinion 
on specific issues. Furthermore, specific examples will be given to 
show that consistent scales satisfying the rigorous requirements pro- 
posed above have been obtained in actual practice. The proposed 
method provides a simple analysis of a series of questions which will 
enable one to determine quickly whether or not the basic condition for a - 
scale is satisfied by the data. 

The notions of variable, function, and simple function. First, a word 
about what is meant by a variable, whether qualitative or quanti- 
tative. We use the term in its conventional logical or mathematical 
sense, as denoting a set of values. These values may be numerical 
(quantitative) or nonnumerical (qualitative). We shall use the 
term “attribute” interchangeably with “qualitative variable.”’ The 
values of an attribute (or of a quantitative variable, too, for that 
matter) may be called its subcategories, or simply categories. 

A variable y is said to be a single-valued function of a variable z if 
to each value of x there corresponds a single value of y. 

In particular, suppose y is an attribute, say like the attribute 
about expression of liking for the British, and takes on three values. 
We may denote by y; the statement, “I like the British’’; by y2, the 
statement, ‘‘I don’t like the British’; and by ys, “I don’t know 
whether or not I like the British.” If z is a quantitative variable 
which takes on at least three values, and if we can divide the x 
values into three intervals which will have a one-to-one correspond- 


3 [bid. 


64 BASIS FOR SCALOGRAM ANALYSIS 


ence with the values of y, then we shall say the attribute y is a 
simple function of z. For example, suppose z takes on the ten 
values 0, 1, 2, 3, 4, 5, 6, 7, 8, 9. Then the correspondence table 
might be as follows: 


The three intervals for x are: 0-2, 3-4, and 5-9, to which correspond 
the values y:, ys, and y, respectively. Every person who has an 
z-value between 0 and 2 has y; as his y-value; every person wlio has 
an z-value of 3 or 4 has y; as his y-value; and every person with an 
z-value between 5 and 9 has 4, as his y-value. 

We might show this graphically by plotting the z-values on a 
straight line, and cutting it into intervals: 


"a Ya us 
0 1 2 3 4 5 6 7 8 9 
Figure 1 


For statistical variables, another representation is in terms of a bar 
chart of frequencies, and this is what is used for convenience below. 
The definition of scale. For a given population of objects, the 
multivariate frequency distribution of a universe of attributes will 
be called a scale if it is possible to derive from the distribution a 
quantitative variable with which to characterize the objects such 
that each attribute is a simple function of that quantitative variable. 
Such a quantitative variable is called a scale variable. 

Perfect scales are not to be expected in practice. The deviation 
from perfection is measured by a coefficient of reproducibility, which 
is simply the empirical relative frequency with which the values of 
the attributes do correspond to the proper intervals of a quantitative 
variable. In practice, 90 per cent perfect scales or better have been 
used as efficient approximations to perfect scales. 

A value of a scale variable will be called a scale score, or simply a 
score. The ordering of objects according to the numerical order of 
their scale scores will be called their scale order. 

Obviously, any quantitative variable that is an increasing (or de- 
creasing) function of a scale variable is also a scale variable. For 
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example, in the illustration above, consider x to be a scale variable. 
Any constant could be subtracted from or added to each of the z 
scores, and y would remain a simple function of the transformed zx. 
Thus, the scores 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 could be replaced by | 
respective scores —5, —4, —3, —2, —1, 0, 1, 2, 3, 4. Or the x 
scores could be multiplied by any constant, or their square roots or 
logarithms could be taken—any transformation, continuous or dis- 
continuous, could be used, as long as the rank order correlation be- 
tween the original z and the transformed variable remained perfect. 
All such transformations will yield scale variables, each of which is 
equally good at reproducing the attributes. 

Therefore, the problem of metric is of no particular importance . 
here for scaling. For certain problems like predicting outside vari- 
ables from the universe of attributes, it may be convenient to adopt 
a particular metric like a least squares metric, which has convenient 
properties for helping analyze multiple correlations. However, it 
must be stressed that such a choice of metric is a matter of con- 
venience; any metric will predict an outside variable as accurately 
as will any other. 

In practice, the rank order has been used as a scale variable. (It 
is in fact a least squares metric for a rectangular distribution of scale 
scores.) 

While rank order is sufficient for the mechanical aspects of testing 
for scalability and of external prediction, it is not adequate for cer- 
tain problems of psychological description and generalization. The 
equations of scale analysis in Chapter 9 provide a psychologically 
meaningful metric, beyond rank order, whose zero point is located 
by means of intensity analysis as in Chapter 7. 


An Example of a Scale of Dichotomies 


As may be expected, the universe of attributes must form a rather 
specialized configuration if it is to be scalable. Before describing a 
more general case, let us give a little example. Suppose that a 
statistics test is composed of the following problems: 

Consider a population of voters in which 60 per cent are Demo- 
crats and 40 per cent are Republicans. 

1. What is the probability that one person chosen at random will 

be a Democrat? 

2. What is the probability that two people chosen at random will 

both be Democrats? 


c 
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3. What is the probability that out of ten people chosen at ran- 

dom, at least three will be Democrats? 

If this test were given to the population of members of the Amer- 
ican Sociological Society, we would perhaps find it to form a scale 
for that population. The responses to each of these questions might 
be reported in dichotomous form as “right’’ or “wrong.” There 
are 2 X 2 X 2 = 8 possible types for three dichotomies. Actually, 
for this population of sociologists we would probably find only four 
of the eight types occurring. There would be (a) the type which 
would get all three questions right, (b) the type which would get the 
first and second questions right, (c) the type which would get only 
the first question right, and (d) the type which would get none of 
the questions right. Let us assume that this is what would actually 
happen. That is, we shall assume the other four types, such as the 
type getting the first and third questions right but the second ques- 
tion wrong, would not occur. In such a case, it is possible to assign 
to the population a set of numerical values like 3, 2, 1, 0. Each 
member of the population will have one of these values assigned to 
him. This numerical value will be called the person’s score. From 
& person’s score we would then know precisely to which problems 
he knows the answers and to which he does not know the answers. 
Thus a score of 2 does not mean simply that the person got two 
questions right, but that he got two particular questions right, 
namely, the first and second. A person’s behavior on the problems 
is reproducible from his score. More specifically, each question is a 
simple function of the score, as will be shown below. 

The meaning of ‘‘more”’ and “‘less.”” Notice that there is a very 
definite meaning to saying that one person knows “‘more”’ statistics 
than another with respect to this sample. For example, a score of 
3 means more than a score of 2 because the person with a score of 3 
knows everything a person with a score of 2 does, and more. 

There is also a definite meaning to saying that getting a question 
right indicates ‘‘more’’ knowledge than getting the same question 
wrong, the importance of which may not be too obvious. People 
who get a question right all have higher scale scores than do people 
who get the question wrong. Asa matter of fact, we need no knowl- 
edge of which is a “right” answer and which is a “‘wrong’’ answer 
beforehand to establish a proper order among the individuals. For 
convenience, suppose the questions were given in a “‘true-false”’ 
form, with suggested answers (1) .50, (2) .36, (3) .42 for the respec- 
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tive questions.‘ Each person records either a 7 or an F after each 
question, according as he believes the suggested answers to be True 
or False. If the responses of the population form a scale, then we 
do not have to know which are the correct answers in order to rank 
the respondents (only we will not know whether we are ranking 
them from high to low or from low to high). By the scale analysis, 
which essentially is based on sorting out the joint occurrences of the 
three items simultaneously, we would find only four types of persons 
occurring. One type would be F,T.F;, where the subscripts indi- 
cate the questions; that is, this type says F to question 1, 7 to 
question 2, and F to question 3. The other three types would be 
F,T,T;, F\F;T;, and T,F;T;. These types can be shown in a chart 
(a “scalogram”’) where there is one row for each type of person and 
one column for each answer category of each question. 

The scale analysis would establish an order among the rows and 
among the columns which would finally look like this: 


Question 
Type score PF, Ts P, T; Fy T; 
3 z z z 
2 z z z 
1 z z z 
0 z z = 
Figure 2 


Or, alternatively, both rows and columns might be completely re- 
versed in order. Each response to & question is indicated by an z. 
Each row has three marks because each question is answered (either 
correctly or incorrectly). The parallelogram pattern in the chart® 
is necessary and sufficient for a set of dichotomous attributes to be 
expressible as simple functions of a single quantitative variable. 
From this chart we can deduce that F, T:, and F; are all correct 
answers, or are all incorrect answers. That is, if we were now told 
that F, is a correct answer, we would immediately know that T; 
and F, are also correct answers. This means that we can order the 
men according to their knowledge even if we do not know which are 
“We shall assume that no one gets an answer right by guessing. Scale analysis can 


actually help one pick out responses that were correct merely by guessing from an 
analysis of the pattern of errors. But for this, much more than three items are neces- 


sary. 

5 Such a chart, where one column is used for each category of each attribute, we call a 
scalogram. The scalogram boards used in practical procedures are simply devices for 
shifting rows and columns to find a scale pattern, if it exists, as will be explained in de- 
tail in the next chapter. 
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the correct answers and which are the incorrect answers, only we do 
not know whether we are ordering them from highest to lowest or 
from lowest to highest. Except for direction, the ordering is a 
purely formal consequence of the configuration of the behavior of 
the population with respect to the items. The importance of this 
fact will become more apparent in more complicated cases where the 
attributes are not dichotomous but have more than two categories. 
As will be shown later, the scale analysis automatically decides, for 

‘example, where an “undecided” response to a public opinion poll 
question belongs, whether it is above ‘‘yes,”’ below ‘‘no,” in between, 
equivalent to “yes,” or equivalent to “no.” A priori judgments of 
content order are not essential to scale analysis. 
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The bar chart representation. Another way of picturing the dichot- 
omous scale of the sample of three items would be as follows: sup- 
pose that 80 per cent of the population got the first question right, 
40 per cent got the second question right, and 10 per cent got the 
third question right. The univariate distributions of the three re- 
spective items could be shown by the bar charts in Figure 3. 

The bars show the percentage distributions for the respective 
questions. The multivariate distribution for the three questions, 
given that they form a scale for the population, can also be indicated 
on the same chart, since all those who are included in the group get- 
ting a harder question right are also included in the group getting 
an easier question right. Thus, we could draw the bar chart over 
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again, but connect the bars with dashed lines in the fashion shown 
in Figure 4. 

Here again we can see how the three questions are simple functions 
of the scores. From the marginal frequencies of the separate items, 
together with the fact that the items form a scale, we are enabled to 
deduce that 10 per cent of the people got a score of 3. The 10 per 
cent who got the hardest question right are included in those who 
got the easier questions right. This is indicated by the dashed line 
on the right, between the scores 2 and 3, which carries the same 10 
per cent of the people (those with a score of 3) through the three 


QUESTION FREQUENCY SCORE 


PERCENTAGE 
DISTRIBUTION 
OF REPLIES 


bars. The 40 per cent who got the second question right include 
the 10 per cent who got the hardest question right and 30 per cent 
out of those who got the hardest question wrong, but all 40 per cent 
got the easiest question right. This leaves us 30 per cent who got 
just the first and second questions right. And so on. Thus we 
can think of an ordering of the persons along a vertical continuum, 
and each dichotomy can be thought of as resulting in one additional 
cut on that continuum. All those above the cutting point get the 
question right, and all those below the cutting point get the ques- 
tion wrong. Thus, there is a one-to-one correspondence between 
the categories of an item and segments of the continuum. Or we 
can say that each atiribute is a simple function of the rank order along * 
the continuum. 

If the “right” and “wrong’’ answer categories are separated, as in 
Figure 2, this bar chart representation assumes the pattern of a 
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parallelogram. Two basic steps are involved in this procedure, as 
will be described in detail in the next chapter. First, the questions 
are ranked in order of ‘‘difficulty”’ with the ‘‘hardest’’ question, i.e., 
the one that fewest persons got right, placed first and with the other 
questions following in decreasing order of ‘‘difficulty.”” Second, the 
people are ranked in order of “knowledge” with the “most informed”’ 
persons, i.e., those who got all questions right, placed first, the other 
individuals following in decreasing order of ‘‘knowledge.”’ These 
two steps are the basic procedure for the scalogram board technique 
of scale analysis. The resulting parallelogram, assuming a scale is 
present, is shown in Figure 5. 


SCORE FREQUENCY CATEGORY ORDER 


Question “Right” Question ” Wrong” 
(3) (2) (3) (2) mM 


3 10 % 
2 30 % 
| 40 % 
fe] 20 % 


Figure 5 


Zero-order correlations between items. It is because all the items 
in the sample can be expressed as simple functions of the same or- 
dering of persons that they form a scale. Each item is perfectly 
correlated with or reproducible from the ordering along the con- 
tinuum. However, the point correlations between the items are 
not at all perfect. For example, the fourfold table between the 
second and third items is as follows: 


Question 2 


Right Wrong 


Right 10 0 10 
Question 3 |__| 
Wrong 30 60 90 


40 60 100 
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The point correlation between the two items is .41. As a matter of 
fact, the point correlation between two dichotomous items may be 
anything from practically zero to unity, and yet they may both be 
perfect simple functions of the same quantitative variable.® 

An important feature of this fourfold table is the zero frequency 
on the upper right hand corner cell. Nobody who got the third 
question right got the second question wrong. Such a zero cell 
must always occur in a fourfold table between two dichotomous 
items which are simple functions of the same quantitative variable. 
This zero cell, furthermore, must occur in the column or row which 
contains the lowest frequency and in that cell which represents a 
“positive” answer on one question and a “negative” answer on the 
other question. Given only the marginal distributions of any two 
scale questions, it is possible to compute the frequencies in each cell 
of a correlation table of the two questions. In fact, if 1s possible to 
construct the multivariate distribution of all the scale questions from a 
knowledge of their straight distributions alone.’ This can be done 
regardless of how many answer categories are retained for each 
question. An example of a scale using trichotomous items will be 
given later. 

This requirement concerning zero-order correlation tables be- 
tween items suggests a procedure for scale analysis based on these 
tables. However, zero-order relationships do not tell the whole 
story about the entire multivariate distribution. Actually, it is 
simpler to study the complete distribution by means of the scalo- 
gram board technique, or related techniques, than to study all the 
bivariate tables. It is helpful, however, to learn what a scale pat- 
tern signifies by exploring some of the consequences in terms of bi- 
variate tables. While the zero-order correlation tables are not good 
to use as a practical technique for testing items for scalability, they 
are a good pedagogical device for understanding scales. 

Zero cells for dichotomies. The presence of a zero cell in the proper 
place is a necessary but not sufficient condition for the existence of a 


* A tetrachoric coefficient for the fourfold table above, assuming a bivariate normal 
distribution, would be unity. However, this is not the correlation between the items. 
It does not tell how well one can predict one item from the other. The tetrachoric co- 
efficient expresses instead the correlation between two quantitative variables of which 
the items are functions, provided the assumptions of normality are true. The reason 
the tetrachoric is unity in this case is that the quantitative variables of which the items 
are functions are one and the same variable, namely, the scale variable. Notice, how- 
ever, that the distribution of the scale variable according to the rank order is not at all 
normal. One of the contributions of scaling theory is to do away with untested and 
unnecessary hypotheses about normal distributions. 

7 The importance of this predictability for correlation analysis of attitude questions 
will be discussed later. 


72 BASIS FOR SCALOGRAM ANALYSIS 


scale. If a correlation table of two attitude questions does not re- 
veal this zero cell, then one can be certain that these two questions 
are not members of a single attitude continuum, i.e., do not have 
but one meaning to the respondents. However, the existence of a 
zero cell in itself is insufficient proof of a single variable, even if the 
cell is in the proper position. First, the two questions must be 
judged to belong to the same content universe by virtue of their 
content. Second, a sample of two questions is too subject to sam- 
pling error (with respect to the universe of content) to provide an 
adequate test; the occurrence of the zero cells in their proper position 
should be found in the multivariate distribution of several questions. 
The desired pattern for more than two questions will become more 
clear to the reader as additional examples are given. 

Let us translate this simple example into a problem of public opin- 
ion analysis. Suppose the issue to be studied is public opinion 
toward the continued maintenance of a large Army now that the 
war is won. Three questions dealing with this topic, together with 
hypothetical percentage distributions of replies, are given below. 


1. In your opinion, is it necessary or unnecessary for the protection of the 
United States to have a strong Army? 


(a) It is necessary 70% 
(b) It is not necessary 30 
2. Do you think the United States should or should not increase the 
present size of the Army? 
(a) It should 50% 
(b) It should not 50 


3. If all other countries agree to disarm, do you think the United States 
should or should not maintain a large Army? 
(a) It should 20% 
(b) It should not “80 


For the present, we will not deal with the problem of interpreting 
which of these percentages—20 per cent, 50 per cent or 70 per cent 
—represents the division of public opinion upon the issue of the size 
of the Army. This problem will be treated in detail in Chapter 7, 
“The Intensity Component in Attitude and Opinion Research.” 

Following the procedure outlined in relation to the statistical 
knowledge example given previously, we can test the hypothesis 
that these three questions form a scale by seeing whether the three 
questions are simple functions of the scale scores. One way of do- 
ing{this would be to draw the bar chart diagram shown in Figure 5 
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and to compare the obtained frequency of the four scale types with 
the theoretical or expected frequency. 

To draw this diagram of expected frequencies, we first place the 
questions in descending order from the question calling for the most 
extreme expression of ‘“‘favorableness” toward a large Army, i.e., 
that question to which fewest persons reply ‘in favor of” a large 
Army, to the question calling for the least extreme expression of 
“‘favorableness” toward a large Army, i.e., that question to which 
the largest number of persons reply “in favor of’’ a large Army.® 


SCORE EXPECTED QUESTION ORDER 
FREQUENCY 


Per cent “ Unfavorable * 
(3) (2) (1) 


Should not 
80 % 


Should not 
50 % 


Unnecessary 
30 % 


This ordering of the questions shows us immediately how many 
scale types should occur and what their expected frequencies should 
be, if we have a scale. Scoring each of the subjects and placing 
them in rank order of “‘favorableness” toward a large Army, i.e., 
the number of questions upon which they express opinions “‘in 
favor of” a large Army, should produce the parallelogram shown in 
Figure 6. 

Thus, in order for these three questions to form a scale, all persons 
who felt that the United States should maintain a large Army even 
if all other countries disarm must also feel that the United States 
should increase the present size of the Army and that it was neces- 


8 The terms “favorable” and “unfavorable” are used in the sense of more or less 
favorable only. To divide such a continuum into “favorable” and “unfavorable” calls 
for the determination of a zero point—a problem which will be discussed in a later chap- 
ter. 
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sary for the protection of the United States to have a strong Army. 
Examination of the fourfold table between any two of the ques- 
tions would have to reveal a zero cell in the corner representing the 
“favorable’”-“unfavorable’”’ cell of the lowest marginal frequency. 
For example, the frequencies of the cells in the cross tabulation be- 
tween questions 2 and 3 are completely predictable from the mar- 
ginals, as follows: 


Question 2 
“Favor- ‘Unfavor- 
able” able” 
“Favorable” 20 0 20 
Question 8 —————“ 
“Unfavorable” 30 50 80 
50 50 100 


To conform to the scale pattern, there should be no respondents 
who feel that the United States should maintain a large Army even 
if all other countries disarm, but who think that the United States 
should not increase the present size of the Army. On the other 
hand, a respondent who feels that the United States should not 
maintain a large Army if all other countries disarm, may or may 
not favor an increase in the present size of the Army. 

Given a scale pattern, the same predictability or reproducibility 
of intercorrelations between any questions is possible from a knowl- 
edge of the straight distributions. In addition, a knowledge of the 
scale score enables one to predict or reproduce the responses of any 
individual to each of the questions asked. The main condition for 
' a scale is satisfied: a person with a higher rank than another is just 
as high or higher on each item. 

Example of a scale of trichotomies. The same technique of scale 
analysis outlined for dichotomous items applies equally well to items 
with any number of answer categories. In fact, this technique 
enables one to determine whether the different answer categories 
should be treated separately as representing meaningfully different 
replies or whether they should be combined. For example, in re- 
sponse to a question, ‘““How important is it for the United States to 
have a large Army?’”’ how should the answer categories “fairly im- 
portant” and ‘“‘not so important” be treated? Should they be left 
as separate categories, or should they be combined into a kind of 
neutral category, or should ‘fairly important” be combined with 
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“very important” while “not so important” is combined with “not 
at all important?” Scale analysis, using the technique outlined, 
will tell one quite automatically how these combinations should be 
made. This will be shown in several examples to be presented in a 
following chapter. 

Let us see what a scale composed of trichotomous items would 
look like. Suppose three questions from the same attitude universe 
were asked, each of which had three answer categories, the answers 
to which distributed as follows: 


Question 
Answer categories 1 2 3 
a 25% 20% 40% 
b 20 60 30 
¢ 55 20 30 


100% 100% 100% 


The number of scale types for these three trichotomous items is 
seven, whereas the number of possible types for a nonscalable area 
is twenty-seven. This reduction in scale types is indicative of the 
highly restrictive nature of the scale pattern. (The number of 
scale types equals the sum of all item categories, less the number of 
questions, plus one, while the number of all possible types equals the 
product of all the item categories.) The characteristics of the scale 
types can be determined simply by joining the answer categories of 
the different questions, as below in Figure 7. 
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Arranging the questions in order of the frequency of ‘‘favorable”’ 
responses (for all three answer categories) and arranging the sub- 
jects in order of scale scores (assume simple weights of two for a, 
one for b, and zero for c), the following bar chart parallelogram 
emerges for the ideal scale pattern (Figure 8). 


SCALE RESPONSE IDEAL CATEGORY ORDER 
SCORE COMBINATION FREQUENCY 
Per cent | .Per cent, Per cent 
“Favorable "Neutrol” "Unfavorable" 
20 le 36 Ib 2b 3b Ie 3c 02 


5 oob 5% pee a fs 
4 obb 15 % Ly | | 
3 bbb 5% ae foe des a xe 
lp | [ 
2 bbe 25 % Cry | 
| 
2 | | 
fot 
( bee 10 % | ' cl | 


Figure 8 


This pattern is completely predictable on the basis of the marginal 
distribution of answers to each of the questions. From a person’s 
scale score it is possible to reproduce exactly how he answered each 
of the questions. From the marginal distribution of responses to 
any of the questions it is possible to construct any of the internal 
correlations between questions. The pattern of intercorrelation for 
questions 1 and 3, for example, would have to be as follows: 


Question 1 
a b c 
a 
Question 3 b 
c 


The basic condition to be satisfied is that persons who answer a 
question ‘‘favorably” all have higher scale scores than persons who 
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answer the same question “unfavorably.” This constitutes a rigor- 
ous definition of a scale. It provides a simple, objective technique | 
for testing the existence of a single variable, that is, for determining 
whether the questions have the same meaning for all respondents. 


The Measurement of Error*® 


The amount by which a scale deviates from the ideal scale pattern 
is measured by a coefficient of reproducitility. This coefficient is 
simply a measure of the relative degree with which the obtained 
multivariate distribution corresponds to the expected multivariate 
distribution of a perfect scale. It is secured by counting up the 
number of responses which would have been predicted wrongly for 

-each person on the basis of his scale score, dividing these errors by 
the total number of responses and subtracting the resulting fraction 
from 1. As will be seen in the actual examples to be presented, the 
occurrence of errors is easily determined by visual inspection of the 
scale pattern. An acceptable approximation to a perfect scale has 
been arbitrarily set at 90 per cent reproducibility. Thus, if a scale 
consisted of five items tested on 100 people, the total number of re- 
sponses would be 5 X 100 = 500. To secure a coefficient of re- 
producibility of at least .90, there could be at most 50 errors for the 
entire sample of 100 respondents on all five questions. 

The coefficient of reproducibility of the universe can be observed 
only with sampling error if the scale pattern is not perfect. There 
may be error due to the sampling of items and error due to the sam- 
pling of people. The problem of item sampling is considered later. 
With respect to sampling deviations due to people, the following 
remarks may be made. 

Reproducibility is computed by counting up the errors for each 
person on each item. If the sample consists of 100 persons and five 
items, the per cent reproducibility is based on 500 observations. If 
the errors of reproducibility are random, and if the population re- 
producibility is at least .90, then the standard error of a sample 
proportion of reproducibility is at most .013, which allows a devi- 
ation in the proportion of at most .040 at the three standard error 
level of confidence. 

Such a calculation of a standard error is, of course, voided if the 
errors are not random, which may often be the case. Empirical 
experiments on samples of 100 cases each on five items, however, 


*For a more detailed discussion of this problem see Chapter 8, ‘‘Problems of Reli- 
ability.” 
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have consistently shown less than a variation of .04 when the com- 
bined sample reproducibility has been over .90. Occasionally an 
item has behaved a little differently for comparable samples of 
people. Raising requirements for reproducibility, and using items 
with more than two answer categories, reduces the possibility for 
such minor variations. 

In some cases, a sample of 200 or more people may be necessary 
to get a clear picture of the situation. 

Reproducibility is not a sufficient criterion for scalability. Repro- 
ducibility by itself is not a sufficient test of scalability. It is the 
principal test, but there are at least four other features that should 
be taken into account: (a) range of marginal distributions, (b) pat- 
tern of errors, (¢c) number of items in the scale, (d) number of re- 
sponse categories in each item. 

(a) Range of marginal distributions: The reproducibility of any 
individual item can never be less than the percentage of respondents 
falling into a single answer category of that item, regardless of 
whether or not a scale exists. For example, if a dichotomous item 
has 80 per cent of the people in one category and 20 per cent in the 
other, there cannot be less than 80 per cent reproducibility in repro- 
ducing that item from a rank order obtained from all the items, 
regardless of the scalability of the set of items as a whole. Thus, 
if a sample comprises only items with extreme kinds of dichotomi- 
zations, reproducibility will be automatically high for that sample, 
regardless of the scalability of the universe. Therefore, to test a 
universe for scalability, attempts should be made to include in the 
sample as wide a range of marginal distributions as possible, and 
specifically to attempt to include items with marginals around 50-50. 

(b) Pattern of errors: If an area is scalable with but 10 per cent 
error (and not artificially so because of extreme marginals), this 
implies that there is but one dominant variable in the area along 
which to order the persons. The errors of reproducibility may be 
caused either (a) by one or two other variables of lesser magnitude 
that may be in the area, or (b) by many small variables. 

The existence of one or two additional small variables as opposed 
to many small variables in the area is indicated by nonscale types 
in the scale pattern which occur with sufficient frequency to be 
noticed, but not with enough frequency to impair substantially the 
reproducibility of the area from only the dominant variable. If 
such definite nonscale types exist, then the multiple correlation of 
an outside variable with the whole area would not be quite equiv- 
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alent to the simple correlation with rank order on the dominant vari- 
able, and would be attained only by taking the nonscale types into 
account. 

On the other hand, if error of reproducibility is random, then the 
multiple correlation of any outside variable on the area will be pre- 
cisely equal to the simple correlation with the rank order on the area. 
This property, it is important to note, holds no matter how low the 
reproducibility is. Some areas which are not scalable are called 
quasi scales; their reproducibility may not be high but their errors 
occur in a sort of gradient. This means that although they lack 
the essential property of a scale—rank order cannot reproduce per- 
sons’ characteristics on the items in the area very well—neverthe- 
less the rank order is perfectly efficient for relating any outside 
variable to the area. 

The difference between random errors, nonscale errors, and gradi- 
ent errors will be discussed in Chapter 5, “The Utility of Scalogram 
Analysis.” 

(c) Number of items: The more items included in a scale, the 
greater is the assurance that the entire universe of which these items 
are a sample is scalable. If the items are dichotomous (or dichot- 
omized from more than two categories as a result of the scale 
analysis), it is probably desirable that at least ten items be used, 
with perhaps a lesser number being satisfactory if the marginal fre- 
quencies of several items are in the range of 30 per cent to 70 per 
cent. Just four or five items, with marginal frequencies outside 
such a range, would not give much assurance as to how scalable the 
universe was, no matter how scalable the sample might be. In 
practice, ten or more items can be used on a pretest to determine 
whether or not a universe is scalable but fewer items can be used in 
the larger study—if the universe is shown to be scalable by the pre- 
test—to obtain the number of ranks necessary for the amount of 
discrimination between people required by the study. 

(d) Number of response categories: The more response categories 
for items included in a scale, the greater is the assurance that the 
entire universe is scalable. A caution to be observed in combining 
response categories to reduce error is to make sure that the reduc- 
tion in error is not just a consequence of obtaining new extreme 
marginal frequencies (e.g., 90-10) that do not permit much error 
(see a above). Equally important is the fact that keeping answer 
categories separate, while it will usually increase the amount of 
error, decreases the possibility of a scale pattern appearing if in fact 
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the universe is nonscalable. For example, four dichotomous items 
with high reproducibility do not provide as dependable an inference 
concerning the scalability of an area as would four trichotomous 
items which were equally as reproducible. It is especially im- 
portant to keep as many response categories as possible when the 
total number of items is small. The more categories that can 
remain uncombined, the more reliable is the inference that the 
universe from which they come is scalable. 

In many cases, of course, sufficient reproducibility may exist with- 
out combining categories. In such a case categories may be com- 
bined anyhow for convenience in final scoring. Combining cate- 
gories in such a case does not disturb rank order except that two 
adjacent ranks are merged. 


The Universe of Attributes! 


A basic concept of the theory of scales is that of the universe of 
attributes. In social research, a universe is usually a large class of 
behavior such as described in the introduction. The universe is the 
concept whose scalability is being investigated, like marital adjust- 
ment, opinion of British fighting ability, knowledge of arithmetic, 
etc. The universe consists of all the attributes that define the con-! 
cept. Another way of describing the universe is to say it consists ' 
of all the attributes of interest to the investigation which have a 
common content, so that they are classified under a single heading 
which indicates that content. 

An important consideration of the present theory of scales be- 
comes that of the sampling of items. In studying any attitude or 
opinion, there is an unlimited number of questions or question word- 
ings which could be used. Any question asked in an attitude or 
opinion survey is ordinarily but a single sample of indefinitely many 
ways the question could be put. It is well known that changing the 
wording of the questions, changing the order of presentation of ques- 
tions, changing order of check lists of answers, etc., can yield ap- 
parently different results in the responses. Conceivably, one could 
ask questions which would secure “favorable” replies ranging from 
0 to 100 per cent, depending upon the extremeness of the statement 
that the respondents are asked to approve or disapprove. It is, 

% The words population and universe are ordinarily used interchangeably in atatistical 
literature. For scales, it is necessary to refer both to a complete set of objects and to a 
complete set of attributes, so it will be convenient to reserve population for the former, 
and universe for the latter. In social research, the objects are usually people, so that 
population is appropriate for them. 
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therefore, essential to inquire into the nature of the universe of all 
possible questions of the same content, and to determine what infer- 
ences can be made about that universe that will not depend on the 
particular sample of questions used. 

Scalogram theory shows that if the universe contains but a single 
variable, that is, if all questions have but a single content ordering, 
then the same rank order of the individuals upon this content will 
be obtained regardless of which sample of questions is selected from 
the universe. The problem of sampling of items thus has a simple 
solution for the case of a scalable universe. 

An important property of a scalable universe is that the ordering 
of persons based on a sample of items will be essentially the same as 
that based on the universe. If the universe is scalable, the addition 
of further items merely breaks up each type given by the sample 
into more differentiated types. But it would not interchange the 
order of the types already in the sample. For example, in Figure 8 
above, type 5 would always have a higher rank order than type 4. 
People in type 5 might be ordered within the type into more sub- 
categories; people within type 4 might be ordered into more sub- 
categories; but all subcategories within 5 would remain of higher 
rank than all those in type 4. This may be seen in reverse, for ex- 
ample, by deleting one of the questions or by combining answer 
categories so as to make a trichotomous question dichotomous, and 
noticing that all that is accomplished is to collapse the number of 
types to a smaller number so that two neighboring types may now 
become indistinguishable; but any types two steps apart would still 
remain in the same order with respect to each other. 

Hence, we are assured that if a person ranks higher than another 
person in a sample of items, he will rank higher in the universe of 
items. This is an important property of scales, that from a sample 
of attributes we can draw inferences about the universe of attributes. 

One of the criteria for selecting a sample of items is to choose a 
sample with enough categories to provide a desired amount of dif- 
ferentiation between individuals." Thus if individuals are desired 
to be differentiated, say, into only ten groups, items should be 
chosen which will yield ten types.'"* The shape of the distribution 


11 The number of possible scale types may be determined as follows: add unity to the 
total number of categories in all questions (after combination) and subtract the number 
of questions. 

13 We are of course not considering problems of reliability in the sense of repeated 
observations of the same attributes. For convenience, we are tacitly assuming perfect 
reliability. See Chapter 8, ‘Problems of Reliability.” 
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of the rank orders in a sample of items will of course depend upon 
the marginal frequencies of the items selected. One sample of 
items may yield a distribution of one shape; another sample may 
yield a completely different shape. This need not be a matter of 
concern, since our primary interest lies in the ordering of people, not 
the relative frequency of each position. 

It might be asked how one can know the universe forms a scale if 
all one knows is a sample from the universe. 

At present it seems quite clear that in general the probability of 
finding a sample of items to form a scale by chance for a sample of 
individuals is quite negligible, even if there are as few as six dichot- 
omous items in the sample and as few as one hundred individuals." 
It seems quite safe to infer in general that if a sample of items is se- 
lected without knowledge of their empirical interrelationships and 
is found to form a scale for any sizable random sample of individuals, 
then the universe from which the items are selected is scalable for 
the entire population of individuals. This problem has already 
been discussed under “Measurement of Error.” 

The relativity of scales. A universe may form a scale for a popula- 
tion at a given time and may not at a later time. Such a change in 
time would tend to indicate that the change is one of kind, rather 
than degree. A new meaning has been added to the previous single 
variable. For example, the items in a scale of expression of desire 
of American soldiers to go back to school after the war may not 
prove to be scalable if they were asked once more at the close of the 
war. 

Conversely, a universe may not be scalable at one time, but scal- 
able at another. This would indicate a change in the structurali- 
zation of the attitude from many dimensions to one dimension, or 
that an “unstructured” attitude has become “structured.” 

A universe may form a scale for one population of individuals, but 
not for another. Or, the items may form scales for two populations 
in different manners. For example, a sample of items of satisfac- 


"To work out the complete probability theory would require two things: first, a 
definition of a sampling process for selecting items, and, second, a definition of what is 
meant by a scale not existing. A definition of the sampling process is difficult. because 
items are ordinarily developed intuitively. Stating a null hypothesis that a scale does 
not exist leads to many possible analytical formulations, for different limiting condi- 
tions may be imposed upon the multivariate distribution of the items. For example, 
should the marginal frequencies be considered fixed in all samples, should the bivariate 
frequencies be considered fixed, etc.? These are questions which may become clearer 
as the theory of scaling develops, and in return may clarify our conceptions of what ob- 
servation of social phenomena implies. 
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tion with Army life which formed a scale for combat outfits in the 
Air Force did not form a scale for men in the technical schools of the 
Air Force. The structure of camp life for these two groups was too 
different for the same items to have the same meaning in both situ- 
ations. 

A universe may not form a scale for the total population, but still 
form a scale for subgroups of that population. The essential defi- 
nition of a scale is that of ‘‘single-meaning,” and while a series of 
questions may contain different meanings to a cross section of the 
population, they may contain only a single meaning for some sub- 
group of that population. However if a scale is obtained for a cross 
section of the population, then that same scale pattern necessarily 
holds for all major subgroups. 

If a universe is scalable for one population but not for another 
population, we cannot compare the two populations in degree and 
say that one is higher or lower on the average than another with re- 
spect to the universe. They differ in more than one dimension, or 
in kind of attitude rather than in degree of “favorableness” on the 
same attitude. It is only if two groups or two individuals fall into 
the same scale that they can be ordered from higher to lower. A 
similar consideration holds for comparisons in time. An important 
contribution of the present theory of scaling is to bring out this em- 
phasis quite sharply. 

Content alone defines the universe. Before the structure of a uni- 
verse can be analyzed, the universe itself must be defined. Let us 
take an example from opinion research where it is desired to observe 
the population of individuals in a standardized manner by a check 
list of questions. The behavior of interest to the investigation is 
responses of individuals to such questions. Suppose the universe 
of items consists of all possible questions which could be asked in 
such a list concerning the fighting ability of the British. Such 
questions might be: ““Do you think the British Army is as tough as 
the German Army?” “Do you think the RAF is superior to the 
Luftwaffe?” etc. (We do not pause here for problems of wording, 
interpretation, and the like. The reader is urged rather to focus on 
the general outline we are trying to establish.) There may be an 
indefinitely large number of such questions which belong in the uni- 
verse; and in a particular investigation, ordinarily only a sample of 
the universe is used. 

An attribute belongs to the universe by virtue of its content. 
The investigator indicates the content of interest by the title he 
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chooses for the universe, and all attributes with that content belong 
in the universe. There will, of course, arise borderline cases in prac- 
tice where it will be hard to decide whether or not an item belongs 
in the universe. The evaluation of the content thus far remains a 
matter that may be decided by consensus of judges or by some 
other means. It may well be that the formal analysis for scalability 
may help clarify uncertain areas of content. However, we have 
found it most useful at present to utilize informal experience and 
consensus to the fullest extent in defining the universe. 

It should be pomted out that an area of qualitative data which 
has been carefully thought through and judged to comprise a homo- 
geneous universe of content does not necessarily form a scale. The 
concepts of universe and of scale are distinct and separate. If a 
universe is not a scale, it cannot be represented by a single rank or- 
der. In some cases, scale analysis may suggest that there are two 
or more subareas in the universe which might be scalable separately. 
Then the universe could be represented by several scale variables, 
by giving each person a rank order on each of the scalable subareas. 
It may happen that a sample of items is analyzed for a group of 
people and is not found to be scalable; but one or more subsets of 
the items seem to be scalable separately. Finding scalable subsets 
of items may sometimes imply that the original universe of content 
can be divided into subuniverses, at least one—or perhaps all—of 
which are scalable separately. To test the hypothesis that a scal- 
able subset is part of a scalable subuniverse, it is necessary to show 
that the content of this subuniverse is ascertainable by inspection, 
and is distinguishable by inspection from that of the rest of the uni- 
verse. A practical procedure to test this hypothesis might be as 
follows: construct new items of two types of content, one type 
which should belong in the original universe but should not belong 
to the scalable subuniverse. If the new items designed for the ap- 
parently scalable subuniverse do scale, and scale together with the 
old subset; and if the new items designed not to be in this sub- 
universe do not scale with the subscale; then the hypothesis is sus- 
tained that a subuniverse has been defined and has been found 
scalable. Each hypothesized subuniverse might be tested this 
way. 

It often happens, when about a dozen questions in an area are 
pretested, that all but one or two are found to be scalable. There 
are at least two possibilities as to why some items do not scale while 
others in the same area do: (a) the universe is not scalable as a whole, 
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but contains a scalable subuniverse; (b) the items have been im- 
perfectly constructed. This latter reason is so easy and glib that 
it is best to avoid it as much as possible. If the vast majority of a 
sample of items do scale, then it may be plausible to blame faulty 
construction for the nonscalability of one or two items. This 
hypothesis can be tested by rebuilding the apparently faulty items 
and retesting them. In practice, if the vast majority of a sample 
of items do scale, the one or two items that do not may often be ig- 
nored. If their construction is faulty, there are sufficient items 
without them to establish rank order for the people; and if they 
really represent a nonscalable part of the original universe, this part 
may be so small as not to be essential to the study. 

Scale analysis does not define content. The question may be asked: 
to what extent, if any, can scale analysis be used to arbitrate dif- 
ferences of opinion with respect to content, that is, as to whether or 
not a given item belongs in an area? The answer is quite simple. 
Scale analysis as such gives no judgment on content;™ it presumes 
that the universe of content is already defined, and merely tests whether 
or not the area is representable by a single variable. It might serve 
as an auciliary argument with respect to content in the special case 
where there is controversy over but one or two items of a large 
sample of items in which the remaining items are scalable. In such 
a case, if the items in doubt do scale with the others, this may be 
taken as additional evidence supporting the contention that they 
belong in the area. It should be emphasized that this kind of 
inference is but auxiliary—there must be a cogent initial argument 
based on content if an item is to be classified in an area. Sheer 
scalability is not sufficient; an item may happen to scale with an 
area, and yet not have the content defining the area—it may be a 
correlate rather than part of the definition. 

An important emphasis of our present approach is that a criterion 
for an attribute to belong in the universe is not the magnitude of the 
correlations of that item with other attributes known to belong in 
the universe. Attributes of the same type of content may have 
any size of intercorrelations, varying from practically zero to 
unity.» 


46 More generally, no correlation analysis as such determines content; it studies only 
formal relationships. If x relates perfectly with y, that does not mean z is identical 
with y. If it is known that the correlation between z and y is .6, that alone does not 
help to name the content of either z or y. 

4% That correlations are no criterion for content has been quite well known. See, for 
example, R. F. Sletto, Construction of Personality Scales by the Criterion of Internal Con- 
sistency (Sociological ‘Presa, Hanover, N.H., 1937). 
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Methods of observation. Let us assume that somehow we have a 
universe of attributes and a population of individuals defined. 
Next, observations are made as to the behavior of the population 
with respect to the universe. (In practice this will often be done 
only with samples. A sample of individuals from the population 
will have their behavior observed on a sample of attributes from 
the universe.) How the observations are to be made is of no theo- 
retical concern here. In opinion research and other fields, ques- 
tionnaires and schedules have been used. But any technique of 
observation which yields the data of interest to the investigation 
may be used. Such techniques for the social and psychological 
sciences might be case histories, interviews, introspection, and any 
other technique from which observations may be recorded. The 
important thing is not how the observations were obtained, but 
that the observations be of central interest to the investigation. 

Use of a questionnaire implies that the investigator is interested 
in a certain type of universe of verbal behavior. Participant obser- 
vation may imply that the investigator is interested in a certain 
type of nonverbal behavior. Such distinct universes may each be 
investigated separately. It may often be of interest to see how 
well one universe correlates with another, but such a correlation 
cannot be investigated until each universe is defined and observed 
in its own right. 

A good deal of attention has been given by various research work- 
ers to the types of observations recorded. Thurstone has ap- 
proached the problem of attitude measurement by asking essen- 
tially, “What are your opinions?” #* Other research workers have 
attempted to get at attitudes by asking different questions. For 
example, Bogardus asks, ‘‘What social relations are you willing to 
tolerate with these people?’’,”” while Pace asks, ““What would you 
do under these circumstances?” * Ford studied an attitude area 
by asking about past behavior and concludes, “One may say 
that the scales for estimating . . . experiences are really attitude 
scales in disguise.” 1° And so in the literature of attitude analysis 


1s, L. Thurstone and E. J. Chave, The Measurement of Attitude (University of Chi- 
cago Press, Chicago, 1929). 

17 See an application of this in Stuart C. Dodd, “A Social Distance Test in the Near 
East,” American Journal of Sociology, Vol. 41, No. 2 (September 1935), p. 195. 

%C. R. Pace, “A Situations Test to Measure Social-Political-Economic Attitudes,” 
Journal of Social Psychology, Vol. 10, No. 3 (August 1939), pp. 331-344. 

wR. N. Ford, “Scaling Experience by a Multiple-Response Technique: A Study of 
White-Negro Contacts, ” American Sociological Review, Vol. 6, No. 1 (February 1941), 
p. 21. 
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we find much space devoted to the construction of areas by means 
of different types of items and check lists. All of these different 
question forms represent only differences in techniques of observing 
the attitude universe. There is no a priori reason to assume that 
if the attitude universe is, say, attitude toward the British, any one 
way of securing observations of this universe is better than any 
other. In fact, if the universe is scalable, all types of questions 
may be used indiscriminately with the same result as far as the 
rank order of individuals in that universe is concerned. If, how- 
ever, one series of items (for example, questions about past experi- 
ences with the British) does not scale with another series of items 
concerning the British (for example, behavior in hypothetical situ- 
ations), then these two series may be regarded as two scalable sub- 
universes of a larger but nonscalable universe. 

Observations made in public opinion polls. It is with regard to the 
manner of observation that public opinion polls have, perhaps, been 
most often differentiated from attitude scales. A question concern- 
ing public opinion, it is argued, is more specific and controversial 
than an attitude question. This specificity has led public opinion 
pollsters to place a great deal of stress upon the “unbiased” con- 
struction of a single meaningful question. Since the validity of the 
poll question depends upon its manifest content, a great deal of con- 
cern is shown over the “‘meaning” of the single question. It is felt 
that practically every word must be tested for understandability, 
bias, single meaning, etc., while check-list answer categories must be 
presented in reverse form or combined in different ways, etc. Actu- 
ally, it is this susceptibility of the single poll question to slight 
changes in question wording that reinforces the position of the 
present approach that attitude questions and opinion questions repre- 
sent simply different forms of observation. 

Any single opinion question is only a sample of one from the whole 
universe of questions in the opinion area. With regard to dif- 
ferences in “specificity” between attitudes and opinions, it is our 
position that any area of behavior can usually be regarded as a sub- 
universe of some larger area, and can itself often be divided further 
into subareas. In other words, there is no means of determining 
when an attitude is specific enough to be called an opinion. Our 
definition of both involves the determination of verbal or nonverbal 
behavior in some area, which may be relatively specific or relatively 
general. In either case, of attitudes or of opinions, validity will 
depend upon the adequate sampling of the entire universe of ques- 
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tions in the area of interest. Scale analysis provides an objective 
test of whether or not any particular opinion poll question contains 
but a single dimension of meaning that is common to all similar 
questions that could have been asked in its place. 

The examples of scales to be given later happen to comprise ob- 
servations made by means of questionnaires. It should not be in- 
ferred, however, that scaling refers only to that technique. Scale 
analysis is a formal analysis, and hence applies to any universe of 
qualitative data of any science, obtained by any manner of observation. 


Summary 


1. The need for scale analysis arises out of the fundamental prob- 
lem of attitude scaling and opinion polling of how to determine the 
dimensions of meaning which the questions asked have for the re- 
spondents. Scale analysis affords a rigorous test for the existence 
of single-meaning for an area and provides a rank order of indi- 
viduals for such areas as are found scalable. 

2. Basic to the present scale theory is the concept of sampling 
the attitude or opinion universe. An unlimited number of ques- 
tions could be asked in any area; the problem is one of selecting a 
sample of questions which are representative of all possible questions 
that might have been asked. Scale analysis affords a test of this 
sampling. 

3. Scale analysis applies equally well to the study of attitudes 
as to the study of opinions. More generally, any technique of ob- 
servation—questionnaires, interviews, participant observation, etc. 
—of any form of verbal or nonverbal behavior which is qualitative, 
yields data which may be studied by scale analysis. 

4. Scale analysis tests the hypothesis that a group of people can 
be arranged in an internally meaningful rank order with respect to 
an area of qualitative data. A rank order of people is meaningful 
if, from the person’s rank order, one knows precisely his responses 
to each of the questions or acts included in the scale. 

5. More precisely, the multivariate frequency distribution of a 
universe of attributes or items for a population of objects or people 
is a scale if it is possible to derive from the distribution a quanti- 
tative variable with which to characterize the objects such that 
each attribute in the universe is a simple function of that quanti- 
tative variable. 

6. The scalogram board technique for determining the existence 
of a scale involves two basic steps: (1) the questions and answer 
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categories are ranked in a preliminary order of extremeness with the 
“most extreme” category, i.e., the one which is endorsed by fewest 
people, placed first and the other categories following in decreasing 
order of ‘‘extremeness,’”’ and (2) the people are ranked in order of 
“favorableness” with the ‘‘most favorable’ persons, i.e., those who 
answer all questions “favorably,” placed first and the other indi- 
viduals following in decreasing order of “‘favorableness.’’ The re- 
sulting pattern, if a scale is present, will be a parallelogram. 

7. There is an unambiguous meaning to the order of scale scores 
and the order of categories within each item. An object with a 
higher score than another object is characterized by higher, or at 
least equivalent, values on each attribute. Similarly, one category 
of an item is higher than another if it characterizes persons all of 
whom are higher on the scale. 

8. From the multivariate distribution of a sample of attributes 
for a sample of objects, inferences can be drawn concerning the 
complete distribution of the universe for the population. 

a. The hypothesis that the complete distribution is scalable can 
be adequately tested with a sample distribution. 

b. The rank order among objects according to a sample scale is 
essentially that in the complete scale. 

9. Perfect scales are not found in practice. 

a. The degree of approximation to perfection is measured by a 
coefficient of reproducibility, which is the empirical relative frequency 
with which values of the attributes do correspond to intervals of a 
scale variable. 

b. In practice, 90 per cent perfect scales or better have been 
used as efficient approximations to perfect scales. 

10. The predictability of any outside variable from the scale 
scores is the same as the predictability from the multivariate distri- 
bution with the attributes. The zero-order correlation with the 
scale score is equivalent to the multiple correlation with the universe. 
Hence, scale scores provide an invariant quantification of the attributes 
for predicting any outside variable whatsoever. 

11. Scales are relative to time and to populations. 

a. For a given population of objects, a universe may be scalable 
at one time but not at another. 

b. A universe may be scalable for one population but not for 
another, or it may not be scalable for an entire population, but scal- 
able for a subpopulation. However, if a universe is scalable for an 
entire population, it will be scalable for all major subpopulations. 
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c. Comparisons with respect to degree can be made only if the 


same scaling obtains in both cases being compared. 
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CHAPTER 4 


THE SCALOGRAM BOARD TECHNIQUE 
FOR SCALE ANALYSIS' 


Rom the previous examples of ideal scales, it is apparent that 

techniques to determine whether or not the basic parellelogram 
pattern of responses is present in a given set of data can be very 
simple. The multivariate frequency distribution of responses should 
be such that from the scale score one can tell exactly what responses 
were made to each of the questions in the scale. This will be the 
case where the responses to each question can be expressed as a 
simple function of the scale scores. To determine whether or not a 
series of questions forms a scale, it is only necessary to test to what 
extent question responses are reproducible from scale scores. 

This may be done in several different ways. Four specific tech- 
niques which have been used successfully are: (1) the scalogram 
board technique, (2) the Cornell trial-scoring and graphic tech- 
nique,” (3) the tabulation technique,’ and (4) the least squares 
method.‘ All four techniques produce essentially the same results, 
since they have the same basic theory. The last three methods 
have been presented elsewhere and will not be discussed here. The 
first method, the scalogram board technique, has received the most 
use to date, being the technique employed by the Research Branch. 
This is the method that will be described in detail in the present 
chapter. 

1 By Edward A. Suchman. 

2 Louis Guttman, “The Cornell Technique for Scale and Intensity Analysis,” Edu- 
cational and Psychological Measurement, Vol. 7, No. 2 (Summer 1947), pp. 247-280. 

* Ward H. Goodenough, “A Technique for Scale Analysis,” Educational and Psycho- 
logical Measurement, Vol. 4, No. 3 (1944), pp. 179-190. 

4 Louis Guttman, “The Quantification of a Class of Attributes: A Theory and Method 
of Scale Construction,’ in P. Horst et al., The Prediction of Personal Adjustment (Social 
Science Research Council, New York, 1941), pp. 319-348. 
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The Scalogram Board 


The scalogram board is simply a device which permits the shift- 
ing of the rank order of both respondents and question categories. © 
By arranging answer categories in ascending order of frequency of 
“favorable” responses and by arranging respondents in descending 
order of “‘favorableness,” it is possible to determine visually whether 
the required parallelogram pattern exists. This ability to shift 
item order and respondent order at will also permits the use of the 
scalogram board for many purposes other than scale analysis. Many 
problems involving an analysis of multivariate frequency distribu- 
tions of qualitative data can be successfully handled visually by 
means of the scalogram boards. 

Two scalogram boards are used, in order to provide both vertical 
(for people) and horizontal (for response category) mobility. The 
two boards are identical in construction. Each board serves as a 
base upon which 100 removable wooden slats are held in place by a 
frame attached to the base. When the frame is off, these strips of 
wood can be removed and shifted up or down the length of the board. 
Each strip has 100 holes bored equidistant and sufficiently deep to 
hold an 118 calibre shot which has been silver-coated. The shots 
are used to indicate each individual’s response to each question. 
The board and slats are stained black so that the contrast of the 
silver balls against the black board enables one to photostat each 
piece of work. Since each board consists of 100 movable wooden 
strips, each strip containing 100 holes, the board is equipped to 
handle the responses of 100 individuals to items with a maximum Vv 
total of 100 categories. 

Legend for Figures 1, 2, and 3 
A, B, C—— Frames attached to base piece 
(hold slats in place) 
D  —— Removable frame 


(When this frame is off, slats can be taken 
out and moved up or down.) 


E — Fill-in strip 


(used to tighten slats and align them for 
transfer to a second board) 
F — Alignment strip 
(used to align boards 1 and 2 for transfer) 
G —— Movable slats—numbered 1 to 100 
H — Screw and nut 
(used to tighten and align slats) 
I — Base 


’ The scalogram board was designed by Louis Guttman. 


Figure 2. 


Scalogram board (showing slats ready to be shifted). 
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STRIPS & PIECES 
ABCD=\y THICK 
BASE PIECE $33 THICK 


Figure 3. Diagram for construction of scalogram board 
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Figure 4. Diagram for construction of removable slats. 
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Pictures of the board are shown in Figures 1 and 2, while a dia- 
gram for the construction of the board is given in Figures 3 and 4. 
Although these are not shown in the pictures, it has been found 
helpful to attach strips numbered from 1 to 100 across the top of 
the board (Frame A) and down the side of the board (Frame B) to 
show the rank order of respondents and items. 


FEEDER ARM 6 


Figure 5. Dropper for insertion of balls. 

How the Dropper Works: The magazine is kept filled with balls. As Feeder Arm A 
is preased in, Feeder Arm B is pushed back and a ball is released. Feeder Arm A in the 
meantime prevents other balls in the barrel and magazine from being released. Relax- 
ing pressure op Feeder Arm A permits it to spring back into place, closing the barrel 
opening by means of Feeder Arm B, and permitting another ball to fall between Feeder 
Arms A and B ready for release when Feeder Arm A is pressed in again. 


The balls are inserted into the holes by means of a dropper which 
expels one ball at a time. The design of such a dropper’ is shown 
in Figure 5. To remove the balls from the holes, it is only neces- 
sary to pass over them with a magnet. 

The space between rows of holes is the same as between columns 
of holes on each board. This is so the holes of a column of one 
board will fit precisely over the holes of a row of the other board. 
Balls can be transferred from the holes of one board into the holes 
of the second board simply by putting the empty board face down 
on the board containing the balls, holding both boards together, 


¢ The dropper was designed and constructed by Fred Sheffield. 
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and turning them over. The balls will then fall from the holes in 
Board 1 into the corresponding holes in Board 2. The balls are 
always in only one of the boards. 

In setting up a scaling problem, the balls are first put into one 
board to arrange the individuals in rank order. The second board 
is used afterwards to arrange the categories in rank order. The 
removable slats of the first board permit the shifting of individuals, 
at the same time holding the order of question categories constant, 
while the removable slats of the second board permit the shifting of 
categories with the rank order of individuals remaining constant. 

In order to change from shifting individuals to shifting categories, 
it is necessary to superimpose the two boards so that the slats of the 
two boards run at right angles to each other, and to turn the boards 
over so that the balls fall out of the holes of one board into the holes 
of the second board. For example, if Board 1 is used to arrange 
individuals, each removable slat represents the responses of one 
single individual to all the questions. These slats, let us say, run 
horizontally, that is, they can be moved up or down on the board. 
(See Figure 6.) The order in which the questions and answer cate- 
gories appear is fixed, and the individual respondents can be moved 
up or down without disturbing the question order. 

To change the category order it is necessary to transfer the balls 
to Board 2. This is done by placing Board 2 over Board 1, with the 
slats running at right angles. The two boards are then held to- 
gether and turned over. In this way the balls are transferred from 
Board 1 to Board 2. (See Figure 7.) Again, the presence of a ball 
represents the answer of a respondent to a question. But on this 
board, the movable slats run in the opposite direction and each 
slat represents a question-answer category. The rank order of 
respondents is fixed, and the order of the categories can now 
be changed without disturbing the respondent order. After rear- 
ranging category order, to change back to Board 1 in order to study 
the effect of the changed question order upon the scalogram pat- 
tern, it is only necessary to place Board 1 on top of Board 2, and 
turn the boards over. The change from Board 1 to Board 2, and 
back again, is accomplished very quickly and in this way changes in 
scalogram pattern can be observed at all stages of the ordering 
process. This visual analysis permits one to determine quite easily 
just how closely the arrangement of respondents from most to least 
“favorable” conforms to the desired parallelogram picture. 
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An Example of Scalogram Analysis 


Before presenting an actual example in detail, let us briefly sum- 
marize the main procedures involved in scalogram analysis. 

There are four basic steps in an ordinary scalogram analysis. 
These are: 


ANSWER CATEGORIES 


RESPONDENTS 


Figure 6. Board 1: Arrangement of individuals. (The filled-in holes represent re- 
sponses. ) 


1. The initial arrangement. The responses of a sample of 100 
individuals to the series of questions to be tested for scalability are 
tabulated. An initial rank order of questions is determined from 
the frequencies of ‘‘positive’’ and ‘‘negative” responses to each 
question. The rank order of ‘‘positive’’ response categories runs 
from the question with the lowest “‘positive’ frequency to the 
question with the highest ‘“‘positive”’ frequency. The rank order 
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of ‘‘negative” response categories runs from the question with the 
highest ‘negative’ frequency to the question with the lowest 
“negative” frequency. The rank order of “neutral’’ responses is 
unimportant. With the order of response categories established, 
the answers of the sample of 100 individuals (unordered) are placed 
into the board, each individual’s responses being contained on a 
single removable slat. 


ANSWER CATEGORIES 


iz 
wie l= 

a 

o 
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RESPONDENTS 


Figure 7. Board 2: Arrangement of questions. 


2. The ranking of respondents. Respondents are scored for ‘“‘favor- 
ableness” according to the number of ‘‘positive’” replies they give. 
The slats are then shifted so that respondents are ranked on the 
board from ‘more favorable” to “less favorable.” We now have 
an approximate ranking of respondents from “‘more’’ to “‘less favor- 
able,” and an approximate ranking of questions according to fre- 
quency of responses to “‘positive” and “negative” categories. If _ 
the data are scalable, the required parallelogram pattern should be ~ 
apparent at this stage. 

3. Combining categories. We now transfer the responses from the 
first scalogram board to the second board. This permits us to shift 
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answer categories without disturbing the rank order of respondents. 
The answer categories of each question are shifted so that all cate- 
gories of a single question or item are adjacent. Those answer 
‘categories which overlap each other are combined into a single cate- 
‘gory. Each item is tested for “improvement” and “error.” A new 
question rank order according to frequency of ‘‘positive”’ and “‘nega- 
tive’’ responses is determined after categories have been combined 
and, if necessary, questions with many “errors” eliminated. The 
slats are shifted once more bringing together all “positive” responses 
and all ‘negative’ responses in proper rank order. 

4. Final arrangement. Responses are now transferred back to 
the first board to permit a reshifting of individuals while the rank 
ordering of questions is held constant. Individuals are once more 
ranked as in the second step from “more” to “less favorable.’ The 
resulting arrangement represents the final scalogram picture. 
Errors are counted and the coefficient of reproducibility computed. 

Now let us take an actual example and carry it through these 
four stages of scalogram analysis. 


Step 1. The Initial Arrangement 


A series of questions constituting a sample of an entire universe of 
items is chosen and given a title indicative of the subject matter 
covered. The twelve questions used in the present example deal 
with soldiers’ attitudes toward the Army and are given in List 1. 
The working sample usually consists of 100 cases which are obtained 
by drawing every ‘“‘nth” schedule or card from the total number of 
schedules or IBM cards in the survey. By judging its content, 
the response to each question is designated as ‘‘positive,” ‘“‘neutral,”’ 
or “‘negative.”’ When several middle positions are present, we may 
have “positive-neutral,” “neutral-neutral,’’ or “negative-neutral”’ 
responses. ‘‘No answers” are kept distinct from the responses at 
first. Always, the apparently most favorable responses to a ques- 
tion are designated as ‘‘positive,’’ and the least favorable as ‘“nega- 
tive.” 


List 1 
Altitude toward the Army 
(Questions are numbered as they were in the questionnaire.) 


26. All things considered, do you think the Army is run about as effi- 
ciently as possible, or do you think it could be run better? (Check one) 
It is run about as well as possible, everything considered 


27. 


28. 


29. 


30. 


31 


32. 


33. 


. 
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2 ______ It. could be run somewhat better 
3 It could be run a lot better 


Do too many of the things you have to do in the Army seem unneces- 
sary? (Check one) 

1 No, not too many of them seem unnecessary 

2 Yes, too many of them seem unnecessary 


In general do you think the Army has tried ita best to see that men 
get as square a deal as possible? (Check one) 

1 Yes, it has tried its best 

2 _____. It has tried some but not hard enough 

3 It has hardly tried at all 


In general do you feel you yourself have gotten a square deal from the 
Army? (Check one) 

1 ____ Yes, in most ways I have 

2 In some ways, yes, in other ways, no 

3 No, on the whole I haven’t gotten a square deal 


Do you feel that the Army is trying its best to look out for the welfare 
of enlisted men? (Check one) 
1 Yes, it is trying its best 


2 It is trying some, but not hard enough — 

3 It is hardly trying at all 
In general how interested do you think the Army is in your welfare? 
(Check one) 

1 Very much 

2 Pretty much 

3 Not so much 

4 Not at all 


What do you think of the statement that “The Army makes a man of 
you”? (Check one) 
1 _____ There’s a lot to it 


2 There may be something to it, but I’m still doubtful 
3 There is not much to it 
4 No opinion 


In the Army, some jobs are naturally harder and more dangerous than 
others and the Army has to put men where it thinks they are needed. 


Considering everything, do you think the Army is trying ita best to 
see that, as far as possible, no man gets more than his fair share of the 
hard and dangerous jobs? (Check one) 


1 Yes, it is trying its best 
2 It is trying some, but not hard enough 
3 It is hardly trying at all 


Do you think the Army is trying its best to see that the men who 
have the hard and dangerous jobs get the special consideration and 
breaks they deserve? (Check one) 


1 Yes, it is trying its best 
2 It is trying some, but not hard enough 
3 It is hardly trying at all 
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35. On the whole, do you think the Army gives a man a chance to show 
what he can do? (Check one) 


1 A very good chance 

2 A fairly good chance 
3 _____ Not much of a chance 
4 No chance at all 

5 Undecided 


36. In general, how well do you think the Army isrun? (Check one) 
1 It is run very well 


2 It is run pretty well 
3 It is not run so well 
4 It is run very poorly 
5 Undecided 


37. Do you think when you are discharged you will go back to civilian life 
with a favorable or unfavorable attitude toward the Army? (Check one) 


1 _____ Very favorable 

2 Fairly favorable 

3 About 50-50 

4 Fairly unfavorable 
5 Very unfavorable 


The terms used here like ‘‘positive,”’ “negative,” and the like, are 
only to indicate a trial ordering of the categories within each item. 
They are not meant to imply being on one side or another of a 
“zero” point. Actually, the scale analysis could be performed 
“blindly,” without looking at the content of the categories. The 
technique is one of successive approximations which will converge 
anyhow to the proper scale ordering—if a scale exists. Judging 
category order within an item beforehand simply enables the first 
approximation to be a very good one, so that not many more itera- 
tions are required. Working “blindly” would involve many more 
iterations ordinarily. The preliminary ranking of categories by 
judging their content is not a theoretical necessity; it is only an ex- 
tremely helpful labor-saving device. If it is uncertain how to rank 
some categories a priori, which may often be the case with the 
hyphenated ‘‘neutral” categories and with ‘‘no answer,” these can 
be given an arbitrary order, which will then be automatically re- 
vised in the course of further steps in the analysis. 

The pattern of our initial scale setup is determined by the fre- 
quencies of the “positive” and “negative’’ categories of each ques- 
tion. The scalogram is built from left to right, column by column, 
each column containing a single category of response for a single 
question. The first column consists of the “positive” category 
with the fewest responses, and so on until all of the “positive” re- 
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sponses of the questions in the series are represented. Here a 
column is left vacant to set off the “positive” from the “neutral” 
responses, which are the next group placed in the board. The 
“neutral” responses may follow the question, or column, order of 
either the “‘positive” or the ‘‘negative’’ responses. If any of the 
questions have more than one “neutral” response, the “‘positive- 
neutral’’ responses of all the questions are placed first, following the 
column order assigned the respective “positive” categories. Im- 
mediately following are the ‘‘neutral-neutrals,’”’ and then the ‘‘nega- 
tive-neutrals,” following the column order assigned the “negatives.” 
Another vertical column is left vacant and is followed by the ‘“‘nega- 
tive” responses. The ‘negative’ category order is determined 
again on the basis of frequency of ‘‘negative’’ responses. The 
categories in the ‘“‘negative’’ section are placed from left to right in 
order of highest frequency to lowest. When ‘‘no answers” occur, 
they are placed a few columns to the right of the scale proper. If 
it is desired to test whether a question of different content correlates 
with the scale, its responses may be set in further on the right side 
of the board, with the columns ranging from ‘“‘positive’”’ to “nega- 
tive.” 

Each of the slats holds the answers to every question on the board 
for one man. Since each horizontal slat bears a number from 1 to 
100, and since each schedule or IBM card is numbered, we can 
readily identify any one of the men for further study or reference. 

The scalogram picture for Step 1 shows the initial arrangement. 
The first column of numbers on the left identifies the rank order of 
respondents, while the second column identifies the sample of 100 
respondents who have been selected at random for scale analysis. 
The numbers across the top of the scalogram refer to the preliminary 
order assigned the categories. At the top of the chart the question 
number and the code number for the response category is given for 
each column. The frequency for each column is given at the bot- 
tom of the chart. Since the total number of answer categories for 
the twelve questions was 43, only half of the columns of the whole 
board are pictured. Columns 1 to 12 represent the most ‘‘positive”’ 
answer category for each of the 12 questions used in this scale. 
Columns 36 to 47 represent the most ‘‘negative’”’ answers for these 
12 questions. Between these two columns, we have the inter- 
mediate answer categories arranged according to “‘positive-neutrals”’ 
(columns 14 to 21), ‘“‘neutrals’”’ (columns 23 to 28), and “‘negative- 
neutrals” (columns 30 to 34). The frequency of “positive” and 
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“‘negative” responses for each of the questions in the scale has been 
tabulated and answer categories have been placed in the board in 
ascending order of frequency for ‘‘positive”’ categories, and descend- 
ing order of frequency for “negative” categories. The ordering of 
“neutral” categories may follow the order of either the ‘‘positive”’ 
or “negative” categories. 

This initial picture, therefore, represents the responses of a ran- 
dom sample of one hundred individuals to twelve questions arranged 


according to the frequency of “positive” and ‘‘negative’’ answers. : 


The responses of respondent number 57, for example, can be read 
from the scalogram as follows: slat 57 has balls in columns 7, 11, 
12, 14, 18, 19, 20, 21, 28, 30, 31, and 36, which represent the follow- 
ing replies to each of the twelve questions: 26-1, 30-1, 33-1, 37-2, 
29-2, 31-2, 36-2, 28-2, 34-2, 35-3, 32-2, 27-2.7 Similarly, it is 
possible to see which individuals answered question 35-1: i.e., per- 
sons 1, 21, 54, 58, 64, 69, 71, 73, 74, 88, 95, and 99. 

An alternative way of arriving at the scalogram for Step 1 would 
be to use Board 2 in which each slat stands for an answer category 
and to put one question in the board at a time with all the answer 
categories for any one question grouped together. The answer 
categories can be shifted from right to left into the desired question 
order, and the balls can then be transferred to Board 1 in prep- 
aration for Step 2—the ranking of respondents. 


Step 2.. The Ranking of Respondents 


The initial setup completed, we start moving the slats, or re- 
spondents, up and down the board to arrange them in rank order 
of “favorableness.”” The men who are “favorable” on all twelve 
questions will have balls in the first twelve columns; these men are 
moved to the top. After these men come those who are “‘favorable”’ 
on eleven questions, then ten, etc. An approximate method which 


has proved useful during this initial ranking is to use simple weights, 


e.g., to give “‘positive’’ answers a weight of 4, “‘positive-neutrals” 3, 
“neutrals” 2, ‘‘negative-neutrals” 1, and ‘‘negatives’’ 0, and then to 
rank individuals according to their scores. An additional aid is to 
rank individuals so that as many balls as possible appear in a solid 
streak for each of the answer categories. As will be shown in Step 
3, the number of errors is computed by marking off these “solid 
streaks” with cutting points on the rank order, and then counting 
as errors the number of individuals who fall outside these cutting 


7 Code numbers will be used throughout to indicate answer categories. 
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points. The previous chapter has already shown how these “‘solid 
streaks” are predictable from a knowledge of the marginal frequency 
distribution of replies. 

Finally, within each of the scale types separately,® individuals are 
ranked so that those individuals who are missing balls in the higher 
numbered columns among the “‘positives” and in the lower num- 
bered columns among the “‘negatives” are placed above those indi- 
viduals who are missing balls not so close to the center columns. 
The purpose of these arrangements is to secure as many perfect or 
near perfect scale types as possible. Since the amount of error is 
computed by counting up deviations from the ideal scale pattern, 
these criteria of respondent order are aimed at permitting maximum 
predictability of responses to all twelve questions from a knowledge 
of the scale score. The basic criterion which determines whether 
an individual should be shifted up or down is the effect this shift 
will have upon the number of errors. How errors are computed 
will be shown in Step 3. 

The scalogram for Step 2 shows the results of the first ranking of 
respondents. Respondents with ‘more favorable” answers have 
been moved to the top of the board, while respondents with “less 
favorable” answers have been moved to the bottom. The first 
column of numbers on the left shows the rank order of respondents, 
while the second column of numbers shows the revised arrangement 
of the respondents’ identification numbers which results from the 
ranking. Answer categories are once more coded at the top of the 
picture. The rank order at this stage of the arrangement need not 
be done too scrupulously. The “neutral” replies especially can be 
handled relatively unsystematically. With practice, the research 
worker will find that the visual pattern is enough to determine a 
good initial rank order of individuals. However, let us take sev- 
eral specific examples of why certain individuals were ranked above 
others to illustrate the general principles involved. 

1. “Weighting.” Rank orders 9 versus 10 (respondents 91 versus 
80). Respondent 91 is placed above respondent 80 because, al- 
though both respondents show 9 “‘positives,”’ person 91 has a weight 
of 7 in the other 3 categories (two “‘positive-neutrals” and one ‘“‘nega- 


* We define “scale type’’ for the purpose of scalogram analysis as that perfect scale 
type which the given individual most closely approaches with the least number of errors. 
If there is more than one perfect scale type to which the given individual approaches 
most closely, he is classified as belonging to that scale type which best maintains the 
“golid streaks.”’ 
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tive-neutral’’) while person 80 has a weight of 5 (one “‘positive- 
neutral,’’ one “neutral” and one “negative” response). 

2. “Solid streaks.’’ Rank orders 52 versus 67 (respondents 45 
versus 86). Both respondents 45 and 86 have the same number of 
“positives,” and yet, despite the fact that 45 has a lower total 
weight (28) than 86 (35), he is placed above person 86. The reason 
for this arrangement can be seen by looking at column 11 across the 
top. If person 86 were placed above person 45, this would intro- 
duce an empty hole into the solid streak of balls in this column. 

3. “Errors close to center columns.’’ Rank orders 7 versus 8 (re- 
spondents 95 versus 58). Both respondents have the same number 
of ‘positives’? and the same total weights. However, person 95 is 
placed above person 58 because the missing ball is in a higher col- 
umn (column 6) than for 58 (column 2) and, therefore, closer to the 
center columns. 

The above examples are intended only to show some of the pro- 
cedures by which a parallelogram pattern with fewer errors will be 
obtained. For the most part, this ranking of individuals can be 
done visually, without scoring. The pattern becomes especially 
clear when only dichotomous items are used. It is sometimes pos- 
sible to tell at this very early stage that the items will not form a 
scale, in which case there is no need to proceed with the next steps. 


Step 3. Combining Categories 


After a first approximate ranking of individuals, the problem be- 
comes one of combining answer categories. As in the case of rank- 
ing individuals, the basic criterion for these combinations is the 
number of errors or deviations from the ideal or expected scale pat- 
tern. Since making these combinations involves shifting the an- 
swer categories, the balls are transferred to the second board to 
permit the shifting of answer categories while the rank order of 
individuals is held constant. Holding the two boards together so 
that the slats of one are perpendicular to those of the other, we turn 
them over. The balls fall into Board 2, permitting the shifting of 
answer categories columns. In order to study each question sep- 
arately the slats for each individual question are brought together. 
This is the pattern shown in the scalogram picture for Step 3. 

Before proceeding with a specific example, let us review what is 
meant by the combination of answer categories. Suppose we are 
ranking 100 individuals in order of “favorableness’ based upon 
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their responses to a single question whose marginal frequency dis- 
tribution is as follows: 


Answer category Number 
Strongly agree (“‘Positive’’) 10 
ee (“Positive-Neutral’’) 20 
Indifferent (“Neutral’’) 20 
Disagree (“Negative-Neutral’’) 25 
Strongly disagree (‘‘Negative’’) 25 
100 


If we assume that this question itself forms a perfect scale, 
the first ten ranks would be occupied by the respondents who 
checked the “positive”? answer category, the next twenty ranks 
(from 11 to 30) by the respondents who checked the “positive- 
neutral” category, and so on. We could mark off these ranks by 
“cutting points” along the rank continuum. From these cutting 
points along the rank continuum, we can predict the respondent’s 
answer to the question. For example, a respondent whose rank 
order is anywhere from 11 to 30 would be predicted as checking the 
“positive-neutral” category. Now if we hypothesize that this ques- 
tion belongs to a scalable universe, we would expect this prediction 
of answer category from rank order to be true for each of the single 
questions in the series of questions which go to make up the scale. 
Each answer category to each question in the scale will serve to de- 
fine an additional cutting point. In a perfect scale, all responses to 
the answer category will fall between the cutting points of that 
category. Any response which falls outside the cutting points will 
be called an error of reproducibility; we would predict the wrong 
response from knowing that the rank order of the respondent fell 
between the prescribed cutting points. This is how we compute 
the error for each of the answer categories shown in the scalogram 
for Step 3. We mark off the cutting points and then count the 
number of responses which fall outside the cutting points. Cutting 
points should mark off consecutive ranks; this means that “solid 
streaks’ in adjacent categories should not overlap and should not 
omit any ranks. Only one cutting point should separate any two 
adjacent categories. 

We now come to an important point. The cutting points should 
separate answer categories which are distinctly indicative of dif- 
ferent degrees of “favorableness” or “‘unfavorableness.” Thus the 
respondents in the “‘positive-neutral’’ category should be distinctly 
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less ‘favorable’ on the scale continuum than the respondents in 
the “positive” category, if the rank order is to be reproducible. 
However, we often find this not to be the case. Instead of the 
rank order of individuals who answer “positive” running from 1 to 
10, and the rank order of respondents who answer “‘positive-neutral”’ 
running from 11 to 30, we find the two mixed rather indiscriminately. 
We could not predict without a large number of errors whether a 
person whose rank order was from 1 to 30 answered ‘‘positive’’ or 
“‘positive-neutral.”’ However, and this is the important point, we 
could predict very well that his answer would be either ‘“‘positive’’ 
or “‘positive-neutral.”” Thus by combining the two categories into 
one we can set one cutting point between 30 and 31 and predict for 
the individuals from 1 to 30 that their answers would be either 
“positive” or ‘positive-neutral.” This is what is meant by com- 
bination—the bringing together of two or more answer categories 
which cannot be reproduced separately into one combined cate- 
gory which can be reproduced. 

The reason for “positive” and “‘positive-neutral’”’ categories fail- 
ing to discriminate between meaningful degrees of “‘favorableness” 
may be due to the verbal habits of people. Some people who are 
basically in the same position on the scale continuum will say 
“strongly agree’ where others will say “agree.’”? This problem of 
verbal habits will be discussed in greater detail in a later analysis of 
verbal intensity. 

Let us look at question 36, for example. The columns 3, 20, 24, 
33, and 42 have been brought together so that all the answer cate- 
gories to question 36 are juxtaposed. The same thing has been 
done for each of the other questions. We are now ready to make 
the necessary combinations of answer categories. 

Measuring improvement. Whether or not the amount of error 
indicates that two or more answer categories should be combined 
can be determined by examining how much the reproducibility of 
responses would be improved by making the combination. 

A way of measuring improvement is to count up the number of 
errors, both of responses which fall outside the cutting point range 
and responses which fail to fall within the cutting point range, and 
to compare this to the number of responses falling within the range 
(nonerrors). A working rule is that the number of nonerrors should 
exceed the number of errors for a category to be kept separate. 

First, it is necessary to compute the number of errors which occur 
in each column. Error is determined for each column by counting 
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up the number of responses in that column which fall outside the 
cutting points of that answer category, plus the number of re- 
sponses within these cutting points which do not fall in the column 
under consideration. Another way of stating this problem would 
be to ask how well does the rank order of an individual permit one 
to predict whether this individual checked the answer category for 
which the error is being computed. For example, we return to the 
response categories to question 36 in columns 3, 20, 24, 33, and 42. 
In column 3 the “positive” responses run from ranks 1 to 19 (the 
first column of figures given on the left indicating the rank order of 
respondents). This places the cutting point below rank 19, and 
we now count as errors for improvement any responses which occur 
below rank 19 or any responses which do not occur from 1 to 19 in 
column 3. This column, therefore, has four errors occurring in 
ranks 30, 31, 34 and 60. Looking at the “‘positive-neutrals” to 
question 36 in column 20, we begin with rank 20 and cont nue to 
rank 83 to indicate those individuals who in a perfect scale would 
have replied in this category to question 36. This time we find six 
errors within this range (in ranks 30, 31, 34, 53, 60, and 75) and 
two errors falling outside this range (in ranks 93 and 99). Column 
33 (category 3) begins at rank 84 and ends at 91, with three re- 
sponses falling outside this range (ranks 53, 75, and 97) and two 
errors occurring within it (ranks 85 and 87). Column 24, contain- 
ing the “neutral” answer to question 36 appearing in rank 92, may 
be considered as beginning and ending at rank 92, thus showing no 
errors within this range. It is not necessary that the “neutral” 
answer be placed between the “‘positive-neutrals”’ and the “negative- 
neutrals,” if other placement will reduce the overall scale error. 
Column 42 begins at rank 93 and runs to 100, showing three errors 
within (ranks 93, 97, and 99) and two errors outside (ranks 85 and 
87) this range. In this manner the errors are computed for all 
columns. These errors are given at the bottom of Scalogram 3 in 
the row marked ‘Errors for Improvement—Uncombined.”’ 

We are now ready to see if certain answer categories should be 
combined. To tell this we examine each column and decide whether 
the category is sufficiently reproducible to stand by itself, or whether 
the pattern of errors indicates that it does not permit reproducibility 
unless combined with the next more or less ‘favorable’ category. 
The criterion to be applied is: to what extent will the combination 
of answer categories increase reproducibility from the rank order of 
the respondents? The problem is essentially one of finding out 
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whether to combine the “‘neutral” categories, “positively” or “‘nega- 
tively,” if at all. As was discussed in the previous chapter, there 
are certain advantages to keeping as many categories intact as pos- 
sible. Additional categories increase the number of scale types be- 
tween which the sample of items can discriminate. Furthermore 
the possibility that a sample of questions chosen from a nonscalable 
universe will apparently form a scale is greatly reduced as the num- 
ber of answer categories that can be kept separate increases. 

A practical procedure is to count the number of errors in each 
category, and if this number is relatively small (e.g., if it is less than 
the number of nonerrors) then the category may be kept separate. 
If the category has too many errors to stand by itself, then an at- 
tempt is made to reduce this error by combination with an adjacent 
category. If no combinations will reduce the number of errors be- 
low that of the nonerrors, then the conclusion is that the question is 
not a simple function of the scale score. This indicates the presence 
of one or more additional variables and does not permit a rank order- 
ing of individuals on a single continuum. 

Once again, let us look at question 36 for a specific example. 
Category 1 (column 3) can stand by itself since it makes only 4 
errors of reproducibility from the rank order as compared with 19 
nonerrors. Similarly, category 2 (column 20) makes only 8 errors 
compared to 58 nonerrors and can also stand by itself. Category 5 
(column 24) contains only one case and so should be combined. 
Combining it with category 2 serves only to increase the amount of 
error in 2, whereas it can be combined without increasing errors with 
category 3. Category 3 (column 33) has 5 errors, but contains 
only 6 nonerrors, and so it just barely satisfies our criterion. Look- 
ing at category 4 (column 42) we see that this category could not 
stand by itself (five errors and five nonerrors). However, com- 
bining categories 3 and 4 and treating responses to these two 
categories as if they were a single category produces only 5 errors 
compared to 14 nonerrors. Combining category 5 with categories 
3 and 4 raises the number of nonerrors to 15 and reduces the num- 
ber of errors to 4. The combination of categories 3, 4, and 5, there- 
fore, reduces the number of errors from 10 to 4. 

By the same process of reasoning, we make combinations for all 
the other questions in the scale as shown in the last row of the scalo- 
gram picture for Step 3. By visual analysis of the overlapping of 
balls within the answer categories of each question, it can be deter- 
mined very rapidly how best to combine the categories so as to 
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minimize the ratio of errors to nonerrors. To combine columns it 
is necessary only to remove the balls from the columns to be com- 
bined with a magnet and to transfer them all to a single column. 

An especially good example of how scalogram analysis aids in the 
determination of how to combine answer categories, and incidentally 
indicates the weakness of assuming that each answer category actu- 
ally represents a meaningful change in degree, can be seen from 
question 37. The balls in the answer categories 2, 3, 4, and 5, 
despite the apparently different manifest intensity of the categories, 
are so overlapping as to belong together. This would seem to indi- 
cate that the respondents are not interpreting the different answer 
categories as representing degrees of “favorableness,’’ as evidenced 
by the fact that a person’s answers cannot be reproduced adequately 
from his rank order on the whole scale. In the same way it is pos- 
sible, for example, to see that the “neutral” category in question 34 
should be combined with the “negative” category. When the deci- 
sion on how to combine answer categories is an important one, it is 
often desirable to increase the sample of respondents to about two 
hundred, to provide greater stability from the point of view of sam- 
pling of people. 

Another principle to be observed in making combinations is that 
of the differentiation of new scale types. As discussed in the previ- 
ous chapter, the number of scale types depends upon the number of 
different cutting points on the rank order. . Two questions with the 
saine marginal frequencies of responses will cut the scale rank order 
in approximately the same place and therefore not contribute to the 
ability of the scale to discriminate. It may sometimes be worth 
while to increase the error slightly in order to secure a combination 
of categories which would produce different marginal frequency 
distributions. - The goal is to secure a series of questions which per- 
mit the widest range of expression from 0 to 100 per cent ‘‘favor- 
able.” 

Combining answer categories results in a change in “positive” 
and ‘“‘negative’’ frequencies and therefore the categories or columns 
must be reordered. A more rigorous use of the scalogram board 
would require that question order and respondent order be checked 
after each single combination made. However, it has been found 
that completing all combinations at once and then reordering pro- 
duces satisfactory results. When the category or column order has 
been reordered with the ‘positive’ categories ranging from low to 
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high frequency, and the “negative” categories from high to low (as 
in Step 1), the balls are ready to be returned to the first board for 
the final step. 


Step 4. Final Arrangement 


The combination and reordering of categories now requires a 
change in the rank ordering of individuals. To shift individuals up 
and down, it is necessary to return the balls to the first board. The 
shifting of respondents is done in exactly the same way as in Step 2. 
Respondents with “‘more favorable” answers are moved to the top 
of the board, while respondents with “‘less favorable” answers are 
moved to the bottom. Once again the basic criterion which deter- 
mines whether an individual should be shifted up or down is the 
number of errors which result from the shift. Due to the decreased 
number of answer categories, the effect of any shift upon the num- 
ber of errors is much more easily discernible than in Step 2. The 
same criteria of “weighting,’’ “solid streaks,’’ and ‘‘closeness of error 
to center columns” serve as aids to the ranking of individuals. 

One final check serves to help obtain the least amount of error. 
The ideal scale types are marked off by means of the cutting points 
for each column and each person is checked to see if moving him to 
a lower or higher scale position would reduce the number of errors 
by which he deviates from the ideal scale type. In general, if a 
respondent can fit equally well into any of several scale types, he is 
put into that scale type closest to the middle of the rank order. 

The method of determining ideal scale types has been discussed 
in the previous chapter. The number of types consists of the num- 
ber of answer categories, minus the number of questions, plus one. 
In the present example this would be: 


(2+24+3424+24+24+24+24+242+42+42)-12+1= 


Perfect scale types would be represented as in Figure 8. 

The scalogram for Step 4 shows the obtained pattern after the 
individuals have been reordered following the combination and re- 
ordering of categories in Step 3. Comparison of the obtained 
parallelogram with the ideal parallelogram permits one to see 
clearly the errors of reproducibility. Note that the reordering 
of individuals changes the number of errors for each category 
from that of the scalogram for Step 3. 
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Figure 8. Perfect scale types. 
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Test for Scalability 


We are now ready to test the scalogram for the existence of a 
satisfactory scale. Four main criteria are used: (1) the coefficient 
of reproducibility, (2) the number of items and response categories, 
(3) the range of marginal frequencies, and (4) the pattern of error. 

1. The coefficient of reproducibility. The coefficient of reproduci- 
bility is computed by means of the following formula: 


Coefficient of = _ 1— number of errors 
Reproducibility number of questions X number of respondents 
asics 134 
a 12 X 100 
= 89 


For computing the coefficient of reproducibility, only one response 
is counted for each question. A single response to a question is 
reproduced from the scale score either correctly or incorrectly. This 
means that only responses which occur outside the scale type are 
counted as errors. It will be remembered that in computing errors 
for improvement (Step 3) the absence of responses within the scale 
type and the occurrence of responses outside the scale type were 
both counted. For reproducibility, errors are counted only once, 
and the easiest way to spot these errors is to count the number of 
balls outside the cutting points. The number of errors for each 
column are shown in the last row of the scalogram for Step 4. 

As discussed in the previous chapter, the lower acceptable limit 
for the coefficient of reproducibility has been placed at about .90. 

The coefficient of reproducibility, while a necessary test of the 
existence of a scale, is not a sufficient one if only a sample of items is 
used. These other conditions must be considered: the number of 
answer categories left after combination, the item marginal fre- 
quencies, and the pattern of error. 

2. The number of answer categories. If more than ten items are 
used in the sample, as-in-the-present-example, then ordinarily the 
final number of categories does not matter; all items might be dichot- 
omized. However, the more categories that can remain uncom- 
bined, the more credible is the inference that the universe is scal- 
able. If less than ten items are used, it may not be too safe to infer 
that the universe is scalable if all the items must be dichotomized 
in order to obtain high reproducibility; at least some should be 
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retainable in trichotomous form to make the inference plausible. 
It may well happen that a sample of items from a nonscalable uni- 
verse might yield high reproducibility when dichotomized, if the 
sample is not very large. Of course, once scalability is established, 
then any further combinations can be made to reduce the number 
of scale types to be used in practice, if that is desired. In the actual 
test for scalability, however, it is best to keep as many answer cate- 
gories as possible. 

3. Range of marginal frequencies. With respect to the criterion 
of item marginals, it is essential to remember that reproducibility 
may be artificially high because the items have extreme kinds of 
frequencies. To avoid reproducibility being artificially high, some 
dichotomies should have marginals close to 50-50. Similarly, for 
trichotomies and other multicategoried questions, the great bulk of ; 
the respondents should not be in any one category (after combina 
tion), except possibly for an occasional question. The present ex 
ample with its wide range of marginal frequencies, half of which lie 
between 40 and 60, satisfies this criterion very well. 

4. Pattern of error. The pattern of error should be inspected to 
see that there are no substantial nonscale types of persons. Solid 
segments in a column that fall outside the cutting points indicate 
the presence of definite additional major factors in the responses, so 
that more than one appreciable dimension is present. Nonscale 
types can be recognized from the occurrence of the same kind of 
error for a large (i.e., five or more) number of respondents. None 
are obvious in the present case. 

It is also important to remember that scale analysis should not 
be depended upon to determine content. An item of differing con- | 
tent may fit into the scale pattern of an area, while items with 
homogeneous content need not scale. Scale analysis can provide 
some auxiliary aid in interpreting content especially in pretests, but 
it is important to keep in mind the cautions discussed previously. 

From the scalogram picture for Step 4 it is now possible to deter- 
mine how the entire population should be scored. As discussed 
previously, a simple weighting scheme is sufficient. Categories of 
dichotomized items are weighted 1 and 0; categories of trichotomized 
items are weighted 2, 1, and 0;etc. A more rigorous method would 
involve scoring individuals according to the ideal scale type which 
fits with the least amount of error, rather than by weighting cate- 
gories. For example, individual 69 (rank order 6) would be scored 
13 according to closest ideal scale type, but 12 according to weights. 
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To score this individual in ideal scale type 13 (positive answers to 
all twelve questions) would result in only one error (his negative 
response 27-2). However, scoring him as 12 (the number of posi- 
tive responses made) results in his deviation from the ideal scale 
type 12 on two counts—his positive response 35-1 and his negative 
response 27-2 (see Figure 8). A more extreme example would be 
individual 20 (rank order 91) who fits into ideal scale type 0 accord- 
ing to the least error criterion (two errors of reproducibility, 32-1 
and 29-1), but who would receive a weighted score of 2 (increasing 
errors of reproducibility from ideal scale type 2 to four errors— 
32-1, 29-1, plus 31-3, 4, and 36-3, 4,5). While scoring according to 
closest ideal scale type is desirable, it is a very slow process. The 
correlation between the two methods is so high as to indicate the 
acceptability of the weighting method for practical purposes. 

A simple scoring diagram for a dichotomous scale when IBM 
card-sorting equipment is available would take the following form, 
as illustrated in Figure 9 for the first five dichotomies from the 
present example. 


First Sort —col. 35 
Second Sort—col. 37 
Third Sort —col. 34 


Fourth Sort—col. 28 


Fifth Sort —col. 26 


Scale Score § 4 8 2 1 0 
Figure 9. Scoring diagram for dichotomies. 


Scaling with only one board. With experience in recognizing the 
parallelogram pattern and errors of reproducibility, the entire 
scalogram procedure can be carried out on one board only. This 
has an important economic advantage, since the cost of construct- 
ing two perfectly matched boards can be eliminated. The con- 
struction of one board alone composed simply of one hundred re- 
movable slats with one hundred holes? in each slat can be done very 


» *Less than 100 or more than 100 holes can be used, since the single board need no 
longer be square. Also since no transfer of balls from board to board is necessary, the 
holes need not be drilled so exactly. 
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cheaply. With a grasp of the principles and methods involved in___ 
the use of two boards, the one-board method becomes quite feasible. 

This method consists of Steps 1 and 2 exactly as outlined. How- 
ever, Step 3, requiring the combination of answer categories and 
the reordering of questions, is also carried out on Board 1. This 
is done by inspection of each question separately as in Step 3, except 
that the different answer categories of each question remain in their 
original position instead of being juxtaposed. Thus the combina- . 
tions for question 36, for example, are decided upon by looking at 
columns 3, 20, 24, 33, and 42 in Step 2 and computing the error as 
outlined in Step 3. The combination decided upon is written down 
for each question, the new frequencies of ‘‘positive’”’ and ‘“‘negative”’ 
are computed, the board is cleared, and Steps 1 and 2 are repeated, 
this time using the new question frequency order. Step 4 is then 
completed exactly as outlined. Step 3 is the only step involving 
the use of a second board. In the case of dichotomous items where 
there is no problem of combining answer categories, one board is 
equally as satisfactory as two. This method using only one board, 
while not as manipulatable as the two-board method, has been used 
very satisfactorily in practice. 

The amount of work. The scalogram for Step 4 shows the final 
multivariate pattern of responses. Some of the various functions 
it can serve will be discussed in the following chapter. Other uses 
will probably be forthcoming with additional research. It cannot 
be emphasized too strongly that this technique is still young, so 
that all of its potential weaknesses and strengths are as yet far from 
known. Undoubtedly many of the specific procedures outlined 
above can and will be improved upon. The best claim that can be 
made for the present procedure is that it has worked and proved its 
utility over four years of attitude and opinion research in the Army. 
In practice it has been found that a clerk can be trained fairly easily 
to carry through from Step 1 to Step 4 in about eight hours. 


CHAPTER 5 


THE UTILITY OF SCALOGRAM 
ANALYSIS' 


T Is important to remember that, in many cases, the present re- 
I search represents only first approximations to perfect scales. 
Once an area was found to have sufficient reproducibility, little 
attempt was made to study it more carefully by means of addi- 
tional pretesting. In fact, in many cases it was often necessary, 
due to the press of wartime conditions, to treat an area as if it were 
scalable even though the four criteria of scalability were met only 
imperfectly. It was also found necessary in many instances to 
omit questions from an area because of low coefficients of improve- 
ment, without making an additional pretest to find out what extra 
factors were involved in the questions. The following examples 
are therefore included as illustrations of a technique rather than 
for their specific content value. 

The scale pictures, or scalograms, on the following pages present 
briefly several actual examples of scale analysis utilizing the scalo- 
gram board technique. Each scalogram will list the questions used 
and the combinations made in the answer categories. The extent 
to which the four criteria of scalability are met will be indicated in 
each case. Following each scalogram will be a brief comment on 
some of the uses which were made of this scalogram. A detailed 
discussion of the utility of scale analysis in general will be given in 
the second section of this chapter. 


1 By Edward A. Suchman. 
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SECTION I 


EXAMPLES OF SCALOGRAM ANALYSIS 
The following scalograms are presented for discussion: 


Scalogram Universe 
Lica cdeseoieenss Satisfaction with One’s Army Job 
De ct Nor te a oar a Attitude toward Officers 
>, EOE SORE Attitude toward the WAC 
Be dre aaieatereh 9% Attitude toward Postwar Conscription 
Da akewiceee aah ae Attitude toward an Army Career 
Ois.5- OeCansdlig wanes Knowledge of Current Events 
f eee Pern ree Fear Symptoms 


1. Job satisfaction. The content of this scale, as can be seen from 
the specific questions asked, concerns the extent to which the soldier 
is satisfied with his job in the Army. The sample consists of nine 
dichotomized questions and has a coefficient of reproducibility of 
.92. Errors appear to be largely random, with no sizable nonscale 
types. The range of positive frequencies for the different ques- 
tions is wide, running from 19 per cent to 78 per cent. One ques- 
tion (question 11, columns 6 and 16), “Do you usually feel that 
what you are doing in the Army is worthwhile or not?” shows al- 
most perfect reproducibility (only 1 error in rank 86). Respondents 
in ranks from 1 to 59 reply, “I usually feel it is worthwhile,” while 
respondents in ranks 60 to 100 reply, “T usually feel it is not worth- 
while’’ or ‘‘Undecided.”’ This single question, therefore, can serve 
as a good approximation to the entire scale if a simple division of 
the population into two ranks is desired with about 60 per cent in 
the upper rank and 40 per cent in the lower rank. If it is desired 
to split the population into some ratio different from 60:40, the 
question which has the marginal frequency with the desired ratio 
is the one to use, even though its reproducibility may not be per- 
fect, if a single question is to be used at all. In the case of imperfect 
reproducibility it is, of course, desirable to use more than one ques- 
tion, even if only a dichotomization of the population is wanted. 

2. Altitude toward officers. This scalogram deals with the enlisted 
man’s opinion of his officers. The sample of items consists of 
eleven questions, all dichotomized. The scalogram has a coeffi- 
cient of reproducibility of .90. The range of marginal frequen- 
cies runs from 8 per cent favorable to 79 per cent favorable, and 
errors are random. In addition to these eleven questions, the re- 


SATISFACTION wits Ong’s Army Jos (Coefficient of Reproducibility 


ScaLocRaM No. 1. 


= .92) 


Category order 


ofa esteem 
0°s‘9*s*a-or 


tesase7 a pio 121316 16 16.07 18 139081 OT £9 £44 90 TT CE HO9DTL FF $F 5695 FC ST OE SD 60 41 62.69 04 45 4447 98 0950 


owe On 09 ES SEEN OL REE LAMM RRR LIDS F9 T¥ PRARDERRRBRS IIIS SB SSSREAOLLL HALA TS LES SORARAELRTBKY 


4, 


2 


10. 


UTILITY OF SCALOGRAM ANALYSIS 125 
Scalogram No. 1. Satisfaction with One’e Army Job 
Questions and Answer Categories 


How do you feel about the importance of the work you are doing now a8 compared 
with other jobs you might be doing in the Army? 


1 It is as important as any other job I could do 
2 It is fairly important, but I could do more important work 
3 It hardly seems important at all 


. How interested are you in your Army job? 


1 Very much interested 
2 A little, but not much 
3 Not interested at all 


How satisfied are you about being in your present Army job instead of some other 
Army job? 


1 Very satisfied 
2 _____ Satisfied 
3 It does not make any difterence to me 
4 Dissatisfied 
5 Very dissatisfied 
. Would you change to some other Army job if given a chance? 
1 Yes 
2 No 
3 Undecided 
. Do you feel that everything possible has been done to place you in the Army job 
where you best fit? 
1 Yes 
2 No 
3 Undecided 


Do you consider your own present job or duty in the Army an important one in the 
war effort? 


1 Very important 

2 Pretty important 

3 Not so important 

4 Not important at all 
5 Undecided 


On the whole, do you think the Army is giving you a chance to show what you 
can do? 


1 A very good chance 

2 A fairly good chance 
3 _____ Not much of a chance 
4 No chance at all 

5 Undecided 


11. Do you usually feel that what you are doing in the Army is worthwhile or not? 


12. 


1 I usually feel that tt 1s noé worthwhile 
2 I usually feel zt ts worthwhile 
3 Undecided 


Which of the following would you say best applies to your job? 


1 Time always passes quickly 

2 ——— Time passes quickly most of the time 

3 Enjoy working part of the time, but it drags at other times 
4 Time drags most of the time 

5 Time always drags 


“No answers” are all coded 0. 
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Scalogram No. 2, Attitude toward Officers 
Questions and Answer Categories 


How do you feel about the privileges that officers get compared with those which 
enlisted men get? 


1 Officers have far too many privileges 

2 Officers have a few too many privileges 

3 Officers have about the right number of privileges 
4 Officers have too few privileges 


Did your officers give you a good chance to ask questions as to the reason why things 
were done the way they were? 


1 Yes, always 

2 Yes, usually 

3 Undecided 

4 No, not very often 

5 No, almost never 
How many of your officers took a personal interest in their men? 

1 All of them 

2 Most of them 

3 ______ About half of them 

4 Few of them 

5 None of them 


Do you think that your officers generally did what they could to help you? 


1 Yes, all the time 

2_____ Yes, most of the time 

3 No, they often did not 

4 No, they almost never did 


How weil do you feel that your officers understood your problems and needs? 


1 They were very much aware of my problems and needs 
2 They were fairly well aware of my problems and needs 
3 They did not know very much about my real problems and needs 


Do you feel that your officers recognized your abilities and what you were able to do? 


1 Yes, I’m sure they did 
2 Yes, I think they did, but I’m not sure 
3 No, I don’t think they did 
4 Undecided 
In general, how good would you say your officers were? 
1 Very good 
2______ Fairly good 
3 About average 
4 Pretty poor 
5 Very poor 
How many of your officers used their rank in ways that seemed unnecessary to you? 
1 Almost all of them 
2 ____ Moat of them 
3 _____ Some of them 
4 Only a few of them 
5 None of them 


128 UTILITY OF SCALOGRAM ANALYSIS 


19. When you did a particularly good job did you usually get recognition or praise for 
it from your officers? 


1 Always 
2 Usually 
3 Rarely 
4 Never 
20. How much did you personally like your officers? 
1 Very much 
2 Pretty much 
3 Not so much 
4 Not at all 


21. How did you feel about the officers that had been selected by the Army? 
1 There were the best ones that could have been selected 


2 They were as good as any that could have been picked 
3 ____ Somewhat better ones could have been picked 
4______. Much better ones could have been picked 
5 Undecided 
22. How much did you personally respect your officers? 
1 Very much 
2 Pretty much 
3 Not so much 
4 Not at all 


23. On the basis of your Army experience, do you think relations between officers and 
enlisted men were satisfactory or unsatisfactory? 


1 Very satisfactory 

2 _____. Fairly satisfactory 

3 Undecided 

4 Fairly unsatisfactory 
5 Very unsatisfactory 


24, When you are discharged from the Army, do you think you will go back to civilian 
life with a favorable or unfavorable attitude toward the officers in the Army? 


1 Very favorable 

2 Fairly favorable 

3 _____ About 50-50 

4 Fairly unfavorable 
5 Very unfavorable 


sponses to three other questions are given on the side of the scalo- 
gram. Two questions, numbers 15 and 16, are offered as examples 
of items which did not conform to the scale pattern. They were 
originally included as items characterizing a soldier’s attitude 
toward his officers, but, as can be seen from the scalogram picture, 
contain a large number of errors. One cannot predict well enough 
from a knowledge of the respondent’s scale score, based upon his 
answers to the other eleven reproducible items, what his responses 
to these two questions would be. 

The next step in a complete scale analysis would be to pretest 
these two questions further in an attempt to find out what addi- 
tional variables they contain. By means of such a pretest it may 
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be possible to break down the universe of attitude toward officers 
into further subuniverses. Or it may be decided that the additional 
variables are irrelevant and one can proceed with the eleven items 
that were scalable. It should not be surprising that acceptable 
scales may not be forthcoming from the first analysis of a series of 
items. Most social attitudes are complex and it is to be expected 
that they contain more than one variable. 

The third question on the side, number 11, was not intended as 
part of the universe of attitude toward officers, but was included in 
the scalogram picture merely to see how closely it correlated with 
the attitude items. If it did happen to conform to the scale pat- 
tern, this might mean that the universe to which it belonged had a 
close relationship with attitude toward officers. However, the 
present scalogram alone is not sufficient to establish the size of the 
correlation; additional items similar in content to the one in ques- 
tion would have to be constructed and also tested along with the 
present series. It should especially be remembered that even if an 
item is reproducible perfectly from scale scores, this is not proof 
that the item is part of the definition of the universe. No matter 
how item 11 correlated with attitude toward officers, we would not 
on the basis of this correlation consider it as part of that universe. 
Only a judgment of content can determine what belongs in a uni- 
verse, and not correlations or reproducibility. This problem of 
determining the items of a universe will be discussed in greater de- 
tail in the next section. 

3. Altitude toward the WAC. The scalogram picture of attitudes 
of enlisted men toward the Women’s Army Corps is shown in 
Scalogram 3. The sample is composed of nine dichotomized ques- 
tions dealing with various aspects of the WAC and has a coefhi- 
cient of reproducibility of .89. Marginal frequencies are distributed 
evenly over a range from 12 per cent “favorable” to 64 per cent 
‘“favorable.”” Question 35 (columns 3 and 13) is of low reproduci- 
bility. Column 3 shows 12 errors and 17 nonerrors. In addition, 
there is a sizable number of errors in the ‘‘positive’’ answer category 
(column 3) which cluster together in the low ranks, indicating that 
there is a group of men with relatively “unfavorable” attitudes 
toward the WAC who nevertheless answer “favorably” on this 
question. This shows that the errors, in addition to being large in 
number, are not random. What possible interpretation is there of 
this nonscale question? 

One hypothesis, which requires further testing before it can be ac- 
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cepted, is that the second variable which this question introduces 
into the scale is one of factual information about the WAC, and not 
of attitude or opinion. The question asks, ‘“Do you agree or dis- 
agree with the following statement, ‘WAC’s get ratings a lot easier 
than men do?’” In general men who are relatively favorable to 
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32. 
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Scalogram No. 8. Attitude toward the WAC 
Questions and Answer Categories 


In your opinion, how necessary is it for the war effort to have WAC’s in the Army? 
1 Very necessary 
2 _____. Pretty necessary 
3 Not so necessary 
4 Not necessary at all 
5 Undecided 


Suppose a girl friend of yours was considering joining the WAC. Would you ad- 
vise her to join or not to join? 

1 I would advise her to join 

2 I would advise her not to join 

3 Undecided 


. In your opinion are the jobs which women in the WAC do more important or less 


important than the jobs which are done by men in the Army who are not on combat 
duty? Or are they equally important? 
1 The WAC does more important jobs 


2 The WAC does less important jobs 
3 The jobs are equally important 
. A woman does more for her country in the WAC than she can do by working in a 

war industry. 
1 Agree 
2 Disagree 
3 Undecided 

The training a woman gets in the WAC will be useful in civilian life. 
1 Agree 
2 : 
3 Undecided 

Being a WAC is bad for a girl’s reputation. 
1 Agree 
2 Disagree 

The WAC is no place for a girl. 
1 Agree 
2 Disagree 

WAC officers deserve a salute just the same as men officers. 
1 Agree 
2 Disagree 

. WAC’s get ratings a lot easier than men do. 

1 Agree 
2 Disagree 


“No answers” are all coded 0. 
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the WAC on other questions are inclined to disagree with this state- 
ment, showing that this item, to a certain extent, does involve the 
variable “attitude toward the WAC.” However, many men who 
in general are among the least favorable to the WAC are also in- 
clined to disagree with this statement despite their generally “un- 
favorable” attitude. It may be that these men are answering the 
question based on their knowledge that, as a matter of fact, WAC’s 
do not get ratings a lot easier than men do. This hypothesis can 
only be checked by additional pretesting. 

Another interesting example of the use of scalogram analysis in 
pretesting the meaning of questions is also given by this scale. In 
a pretest form, question 27 originally read, “In your opinion how 
necessary is it for the war effort to have women in the Army?” 
as compared to the present wording which substitutes the word 
“WAC’s” for “women.” It was found, upon testing responses 
for scalability, that this question as originally worded contained a 
great deal of error. Many individuals who occupied low ranks on 
the scale answered that it was necessary to have “women” in the 
Army. Further pretesting revealed the reason for this discrepancy. 
These individuals were interpreting the ‘‘women”’ to include nurses 
rather than only WAC’s. Substituting the word “WAC’s” for 
“‘women,’”’ removed this source of error as can be seen from column 
4 of the present scalogram picture. 

4, Altitude toward postwar conscription. This is only a five-ques- 
tion scale, but it contains two trichotomies, which help to reduce the 
possibility that this sample of five questions could produce the scale 
pattern and yet not belong to a scalable universe. Reproducibility 
is .94, errors are random, and the range of marginals is diverse. 

The two trichotomies in this scale illustrate quite clearly what is 
implied by a separate middle response category. For example, the 
first trichotomy, based on question 60, divides the population into 
three ranks with only seven errors of reproducibility. In 93 cases 
out of 100, we could predict correctly how the respondent answered 
this question on the basis of the respondent’s rank order. This 
single question could be used alone to divide the population into 
three rank groups with regard to the issue of universal military 
training. The fact that this question was selected from the scalo- 
gram pattern gives one increased assurance that, even if a different 
question were asked from the indefinite number of opinion ques- 
tions that could be asked on this issue, the rank ordering of indi- 
viduals would be essentially the same as on this question. 
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It is also instructive to note what these two trichotomies con- 
tribute to the scale continuum in the way of differentiating scale 
types. These scale types can be marked off according to the pro- 
cedure previously outlined. The number of scale types would be 
decreased from eight to six if these trichotomies were to be dichot- 
omized; combining the middle category of question 60 with the top 
category (column 6 with column 2) would combine scale types 5 
and 6, while combining the middle category with the bottom cate- 
gory (column 6 with column 11) would eliminate the distinction 
between scale types 1 and 2. In a similar way for question 61, 
combining columns 3 and 7 would combine scale types 4 and 5, 
while combining columns 7 and 12 would combine scale types 0 
and 1. As has been pointed out, combining answer categories does 
not affect the rank order of individuals, but it does cut down on the 
number of ranks between which the scale discriminates. The num- 
ber of ranks into which the scale continuum can be divided is a 
direct function of the diversity of marginal frequencies of the items 
used. 

It is worth noting that the combination of answer categories may 
be quite different from that expected on the basis of manifest con- 
tent. In both of the present examples of trichotomized items, the 
following combinations were indicated: 


“Positive”... sii: “Strongly agree”’ 
“Neutral” ......... “Agree” and “Undecided” 
“Negative” ......... “Disagree,” “Strongly disagree,” 


and “No answer.” 


It should be remembered that to a certain degree these combi- 
nations are arbitrary, and that where it is desired to base the com- 
binations solely upon the scale pattern, it is advisable to increase 
the working sample of respondents from 100 to 200 cases, using two 
boards. 

5. Attitude toward an Army career. This scale on the desire of 
enlisted men to remain in the Army after the war, while based on 
only six dichotomized items, seems almost perfect; the sample shows 
the extremely high reproducibility coefficient of .98. The six items 
have marginal frequencies ranging from 12 per cent “favorable” to 
59 per cent “favorable.” This scale was used in a problem of pre- 
dicting which men would be most likely to want to stay in the 
Regular Army after the war. While the prediction of how many 
requires more than just the scale data, the presence of a scale pat- 
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tern enables the research worker to know that a single variable 
underlies the series of predictive questions asked. To a certain 
extent, the high reproducibility of this scale is probably due to the 
emphasis each of the questions places upon the conditions under 
which the respondent could be induced to remain in the Army. Since 
these conditions may be thought of as cumulatively more favorable 
to reenlistment, the resultant scale pattern is a logical conclusion. 
In this connection it is worth noting that the single question, ‘“‘Re- 
gardless of what you want to do, do you think you will actually 


Scalogram No. 4. Altitude toward Postwar Conscription 
Questions and Answer Categories 


57. After the war, do you think the United States could or could not have an Army 
sufficient for the country’s needs by taking volunteers only? 
6 Yes, I’m almost sure it could 
7 Yes, I think it could, but I’m not sure 
8 ______ No, it probably could not 
9 No, I’m sure it could not 


58. After the war, do you think the United States should draft all young men for a cer- 
tain amount of Army training or should we go back to the regular Army system of 
volunteers only? 


6 Draft all young men for a certain amount of training 
7 _____ Go back to taking volunteers only 
8 Undecided 


59. If you had a son, would you want him to have a certain amount of Army training 
during peacetime, after this war, or not? 


6 Yes, I’m almost sure I would 
7 Yes, I think I would, but I’m not sure 
8 No, I probably would not 
9 No, I’m sure I would not 
11 Undecided 


60. Do you agree or disagree with the statement? 
“Even if all other countries agree not to have large armies, the United States should 
still draft all young men for military training.” 


6 Strongly agree 
7 ——. Agree 
8 Undecided 
9 Disagree 
ll Strongly disagree 


61. Do you agree or disagree with the statement? 
“The best way to protect the United States against another war is to make her so 
strong militarily that no one would dare to attack her.” 


6 Strongly agree 
7 _-__. Agree 
8 Undecided 


Disagree 
11 Strongly disagree 
“No answers” are all coded 12. 
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stay in the Army after the war?” fell out of the scale pattern (ques- 
tion 37—column 30, with ten errors and eight nonerrors). This 
suggests that while the universe being studied is one of desire 
to stay in the Army (which was the content used in the construc- 
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Scalogram No. 6. Attitude toward an Army Career (Enlisted Men) 
Questions and Answer Categories 


Are there any conditions you can think of under which you might consider staying 
in the Army after the war? 
1 I will probably stay in the Army after the war 
2 Under certain conditions I might consider staying in the Army 
3 I wouldn't consider it under any condition 
Do you think that you might want to stay in the Army for a career after the war? 
1 Yes, I would want to very much 


2 Yes, I might want to, but I’m not sure 
3 No, I don’t think I’d want to 
4_____ No, I would not want to at all 

5 Undecided 


Do you think that you might want to stay in the Army for a year or two after the 
war? 


1 Yes, I would want to very much 


2 Yes, I might want to, but I’m not sure 
3 No, I don’t think I’d want to 

4 No, I would not want to at all 

5 Undecided 


If we have a large Army after the war (say 2,000,000 men) do you think you might 
want to stay? 


1 Yes, I'd like to stay on if we have a large Army 


2 _—___ I might want to stay if we have a large Army, but I’m not sure about it 
3 A large Army would make no difference. I wouldn’t stay on anyway 
4 Undecided 


If civilian jobs are hard to get after the war, do you think you might want to stay 
in the Army? 


1 I would want to stay in the Army 


2 I would want to get out to look for a job, but I might want to come back 
to the Army 

3 I would rather take any kind of job outside than stay in the Army 

4 Undecided 


Do you think you might want to stay in the Army after the war if you could change 
your Army job? 


1 I would want to stay, whether I could change my job or not 


2 I might want to stay if I could change my Army job 
3 I wouldn’t want to stay, no matter what Army job was offered me 
4 Undecided 


Regardless of what you want to do, do you think you will actually stay in the Army 
after the war? 


1 Yes 
2 No 
3 Undecided 


‘‘No answers” are all coded 0. 


ScaLoeraM No. 6. KNOWLEDGE OF CURRENT Events (Coefficient of Reproducibility 
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tion of the sample of items), there are individuals who planned to 
stay in although they wanted to get out, and vice versa. The “‘de- 
sire” to stay in the Army and the “‘plan”’ to stay in represent two 
separate, although related, areas. The present scale represents a 
ranking of enlisted men according to their desire to stay in the 
Army; how this desire relates to actual reenlistment is a separate, 
although extremely interesting, problem. The internal validity of 
a scale which purports only to rank men on a continuum of desire to 


41. 


43. 


46. 


Scalogram No. 6. Knowledge of Current Events 
Questions and Answer Categories 


. Who is the Chief of Staff of the U.S. Army? 


6 General MacArthur 
7 General Eisenhower 
8 General Arnold 
9 General Marshall 

Which of the following countries is at war against the Axis? 
6 Argentina 
7 Sweden 
8 Brazil 
9 Turkey 

. Where was the first major beach head established by the Allies in their invasion of 

Europe? 
6 Southern France 
7 Normandy 
8 Holland 
9 Belgium 

Who is the recently appointed Secretary of State? 
6 Welles 
7 Stettinius 
8 Nelson 
9 Byrnes 

. The man who said we would have “peace in our time” was 

6 Churchill 
7 Mussolini 
8 Chamberlain 
9 Tanaka 


. The ‘“‘Co-Prosperity Sphere” was a scheme of 


6 The Germans 

7 The Spanish 

8 ______ The Italians 

9 The Japanese 
“Fascism” began in 

6 Treland 

7 Russia 

8 Italy 

9 - Germany 


“No answers’ are all coded 12. 


Category order 


Scanocram No.7. Fear Symptoms (Coefficient of Reproducibility = .92) 
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do something cannot be judged by an external criterion of behavior 
which reflects many other variables besides the desire. 

6. Knowledge of current events. Information items can also form 
scales. When they do, a score on such a scale has a definite mean- 
ing. If answers are scored as “right’’ or ‘‘wrong,”’ a person with a 
higher score must know everything that a person with a lower score 
knows and something additional. Ranking individuals in order of 
knowledge can be done with the assurance that additional questions - 
of knowledge on the same topic will always produce essentially the 
same rank order for the people. 

The present example is presented for illustrative purposes only. 
The fact that only seven items were used, each of which was dichot- 
omized, plus the concentration of four of these items at 70 per 
cent correct or above, does not indicate high reliability. 

Scalogram 6 shows the pattern obtained from seven questions 
dealing with current events. All items are dichotomized into 
“right” or “wrong,” and the coefficient of reproducibility is .93. 
From this sample we see that, within the limits of error, if a person 
knows that “the ‘Co-Prosperity Sphere’ was a scheme of the Japa- 
nese,” he will know the answers to all the other questions. The 
scalogram can help one decide whether an information question is 
really getting at attitudes rather than information—or, stated an- 
other way, how much affective tone is involved in what appears to 


Scalogram No.7. Fear Symptoms 
Questions and Answer Categories 


Soldiers who have been under fire report different physical reactions to the dangers of 
batile. Some of these are given in the following list. How often have you had these 
reactions when you were under fire? Check one answer after each of the reactions listed 
to show how often you had the reaction. Please do it carefully. 


Violent pounding of the heart 


25 Sinking feeling of the stomach 
26 Feeling of weakness or feeling faint 
27 Feeling sick at the stomach 
29 ______ Cold sweat 
30 Vomiting 
31 Shaking or trembling all over 
32 _____ Urinating in pants 
34____ Losing control of the bowels 
35 Feeling of stiffness 
The answer categories for each of the above were: 
5 Often 
6 Sometimes 
7 Once 
8 Never 
12 No answer 
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be a factual statement. It is to be expected that if an attitude item 
were included in this series of information questions, individuals 
with a “favorable” attitude would occur frequently enough among 
both the well and badly informed so as not to satisfy the rather rig- 
orous requirements of reproducibility. The same would be true of 
an information item included in a series of attitude items. 

7. Fear symptoms. An interesting application of scale analysis 
is given by the scalogram of the reports of combat veterans on ten 
different physiological fear symptoms. All but one of the symp- 
toms, “Cold sweat” (columns 41, 42, 43) fit into a scale pattern. 
In almost all cases the answer categories ‘Often’ and ‘“‘Sometimes”’ 
are combined to signify a ‘“‘positive’’ report on the occurrence of the 
symptoms, while ‘Once’ and “Never” are ‘‘negative.”’ Nine dichot- 
omized items result in a coefficient of reproducibility of .92, and the 
frequencies range from 9 per cent who reported ‘urinating in pants”’ 
to 84 per cent who experienced ‘‘violent pounding of heart.’”’ The 
rank order of the dichotomized symptoms permits one to predict, 
for example, that if a man experienced “shaking or trembling all 
over’ (column 7) he must also have experienced “sinking feeling of 
the stomach” (column 8) and “violent pounding of the heart” 
(column 9). 

In other words, the symptoms come from a single universe and 
permit a rank ordering of respondents along a single continuum. 
There is an intrinsic interdependence among the different fear symp- 
toms which permits them to be ordered from more to less severe. 
In this case the underlying continuum is probably physiological; in 
the case of attitudes, the ability to rank items from more to less 
severe is probably due to the similarity of cultural influences for the 
population studied. 


4 


* * * * * 


Many different methodological uses of scalogram analysis have 
been illustrated in the above discussion of actual scale pictures. 
Many more examples could be given for each of the applications 
discussed; other examples could be given to show additional appli- 
cations. It can be expected that further research will serve to dis- 
cover even more applications. 

The following section will now present a general summary of the 
utility of the scalogram as a means of scale analysis and point out 
some of the advantages and ramifications of scale analysis. 
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SECTION II 


THE UTILITY AND RAMIFICATIONS OF SCALOGRAM ANALYSIS 


The previous section has presented several examples of scale 
analyses made by means of the scalogram board. These examples 
were discussed from two points of view: (1) how did the scalogram 
board help to determine whether or not a scale existed, and (2) how 
did the knowledge of whether or not the area being studied was 
scalable help in the analysis of the problem. The first aspect deals 
with the research advantages of the visual multicorrelational anal- 
ysis permitted by the mechanical device called the scalogram board, 
while the second aspect deals with the much broader problem of the 
research advantages of knowing that an area is scalable, this is, that 
the series of items composing the scale permit a rank order of indi- 
viduals along a single continuum. 

It is important to keep clearly in mind the distinction between the 
advantages of a test for scalability of items, such as the scalogram 
board affords, and the subsequent advantages of the use of scales 
for research, if the test shows that a scale exists. The question, 
“How does a scale analysis as carried out on the scalogram board 
(or by the Cornel] technique or by some other equivalent procedure) 
help the research worker in his analysis of a series of items?” is quite 
different from the question, ‘‘How does the fact that a series of items 
forms a scale help the research worker in the use he makes of such a 
series of items?” In this section we shall first present an answer 
to the first question, while an answer to the second question will be 
given in a later part. The final section of this chapter will present 
some of the further ramifications which the problem of scaling holds 
for future attitude research. 


Utility of the Scalogram Board 


[ The scalogram board derives its utility from the fact that the basic 
theory of the present concept of scale analysis calls for a simple, 
although highly restrictive, pattern of interrelationship between the 
items being tested. This pattern, as was discussed in detail in 
Chapter 3, requires that only one possible combination of item 
responses characterize each scale rank, and furthermore that each 
scale rank differ from the scale rank above and below it by only 
one item response. | Thus the number of types observed in a scal- 


tS 
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able area is greatly reduced from the total number of possible types. 
For example, a scalable area of ten dichotomous questions would 
call for only eleven scale types out of a possible 1,024 types. [it is 
this highly restrictive nature of scalogram analysis which permits 
the visual analysis of the multivariate frequency distribution of re- 
sponses of a large number of items for the scale pattern.|\ 

The parallelogram pattern described in Chapter 3 is one means of 
representing the item interrelationships in a scalable area. By ar- 
ranging individuals in descending order of “‘favorableness” and by 
arranging item responses in ascending order of frequency of “‘favor- 
able” responses, a scalable series of items will result in a parallelo- 
gram pattern. If a device could be constructed which permitted 
both the ordering of individuals and the ordering of items, then it 
should be possible to test a series of items very quickly for the re- 
quired parallelogram pattern of a scale. The scalogram boards are 
just_such a device. 

The scalogram boards permit the research worker to shift the 
rank order of individuals while the order of the items remains con- 
stant, and vice versa, enabling him to determine visually whether 
or not the desired parallelogram pattern is forthcoming. \ Without 
such a visual aid, the number of multiple correlations that would 
have to be computed would be staggering. By means of the scalo- 


gram board, the research worker can see just how the various items 


relate to each other and to the pattern as a whole. 

The Cornell technique and other equivalent procedures referred 
to at the beginning of Chapter 4 provide the same picture without 
using actual boards. Although less flexible, the alternative tech- 
niques also result in the visual scalogram desired. 

The helpfulness of a scalogram analysis lies in two directions: 
(1) the type of general pattern that is observed can help in the fur- 
ther definition of the problem, while (2) the specific picture of the 
individual item and its answer categories can help in the construc- 
tion of items. ) Examples of these two uses of scalogram analysis 
are given below. 


1. Scalogram Analysis and the Definition of the Problem 


It is often found during a preliminary scalogram analysis that a 
series of questions that were thought to contain only a single vari- 
able actually do not scale, but that if the series is broken down into 
subseries, these subseries do form scales. The researcher can now 
reexamine his original hypothesis and perhaps reformulate his prob- 


UTILITY OF SCALOGRAM ANALYSIS 145 


lem in terms of multiple variables instead of a single common vari- 
able. Once he has redefined his problem in terms of subuniverses 
of items, he is in a position to construct new items characterizing 
the new subuniverses and to test his revised hypothesis. It is im- 
portant to remember that scalogram analysis itself cannot define 
one’s problem; it can only serve to suggest alternative definitions , 
which then must be tested in their own right. 

A second use of scalogram analysis during the definition stage of a 
problem is to indicate the existence of nonscale types. Many re- 
search problems involve comparisons of different types of groups, 
groups which differ in kind, rather than degree. The preliminary 
scalogram can often show quite clearly when the problem is one of 
differences in kind rather than degree. Furthermore, it can often 
point out just what the main types are, their characteristics and 
relative frequency. On the basis of this information, the researcher 
may decide whether or not he has to rethink his problem in terms 
of a typological analysis rather than one involving rank orders. 

The following two examples will illustrate the utility of a scalo- 
gram analysis for the definition of a problem. 

a. Attitudes toward ‘‘sweating out a jump’’: nonscale types. Scalo- 
gram 8 shows a great deal of error (coefficient of reproducibility 
= .77); therefore we cannot conclude that there is a single variable 
in the area of ‘‘worries in anticipation of a parachute jump.” The 
differences among individuals appear to be differences in kind of 
worry, rather than degree of worry. Soldiers may be characterized 
by the kinds of things they are afraid of in making a parachute 
jump, rather than the degree to which they are afraid. A further 
analysis of this problem might take the form of a typological com- 
parison. 

Contrast this result with that obtained previously with the uni- 
verse of fear symptoms among combat soldiers (Scalogram 7). In 
that example, symptoms were found to scale and we could speak of 
the degree of fear present. In the present series of items we cannot 
speak of the relative amount of fear shown by paratroopers, but of 
the different kinds of fears they exhibited. 

b. Attitude toward combat. This example, showing how the exist- 
ence of a nonscale type serves to indicate a possible second variable, 
will illustrate the way in which scalogram analysis may help to break 
down a nonscalable area into scalable subareas. A series of six ques- 
tions dealing with “readiness for combat” appears to form a scale 
whose coefficient of reproducibility is .94 (Scalogram 9). However, 
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Scalogram No. 8. Attitude toward “Sweating Out a Jump” 
Questions and Answer Categories 


Different men say that they sweat out different parts of the jump. Check each one 
of the following list of items to show how much you sweat out each part of the jump. 


During the jump today did you sweat out: 


9. Whether you would get a rough landing? 


1 I sweat out the idea of a rough landing a lot 
2 I sweat out the idea of a rough landing some, but not too much 
3 I did not sweat out the idea of a rough landing much at all 


10. Waiting around in the sweat-shed? 


1 I sweat out waiting around a lot 
2 I sweat out waiting around some, but not too much 
3 I did not sweat out waiting around much at all 


11. Whether you would get a hard opening shock? 


1 I sweat out the idea of a hard opening shock a lot 
2 I sweat out the idea of a hard opening shock some, but not too much 
3 I did not sweat out the idea of a hard opening shock much at all 


12. Whether you would freeze in the door? 


1 I sweat out the idea of freezing in the door a lot 
2 I sweat out the idea of freezing in the door some, but not too much 
3 I did not sweat out the idea of freezing in the door much at all 


13. Whether the chute might fail to open or malfunction? 


1 I sweat out the idea of the chute failing to open a lot 
2 I sweat out the idea of the chute failing to open some, but not too much 
3 I did not sweat out the idea of the chute failing to open much at all 


14. Riding in the airplane? 


1 _____I sweat out the ride in the plane a lot 
2 I sweat out the ride in the plane some, but not too much 
3 I did not sweat out the ride in the plane much at all 


15. The free-fall in the air before the chute opens? 


1 I sweat out the ides of the free-fall a lot 
2 I sweat out the idea of the free-fall some, but not too much 
3 —______ I did not sweat out the idea of the free-fall much at all 


16. Whether you might get tangled up with another man on the way down? 


1 I sweat out the idea of getting tangled up a lot 
2 I sweat out the idea of getting tangled up some, but not too much 
3 I did not sweat out the idea of getting tangled up much at all 
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two nonscale types of five cases each appear in ranks 19 to 23 and 
69 to 73. The errors producing these nonscale types occur in col- 
umns 3 and 10, which contain the answers to questions 15 and 16. 
A cross tabulation of these two questions alone seems to indicate 


16. 


17. 


21. 


22. 


23. 


Scalogram No. 9. Attitude toward Combat 
Questions and Answer Categories 


. Which of the following best tells the way you feel about getting into the actual 


battle sone? 
1 


I want very much to get into it just as soon as possible 

I’m ready to go anytime 

__—— I'd like to go before it’s over, but I don’t think I’m ready yet 

I hope I won’t have to go, but if I do I think I’ll do all right 

I hope I won’t have to go because I don’t think I would do very well 
No opinion 


2 
3 
4 
5 
6 


Do you feel that you are now trained and ready for combat or do you need more 
training? 


1 


2 
3 


I’m ready for combat now 
I need a little more of some kinds of training 
I need a lot more of some kinds of training 


If and when you get into combat how well do you think you will stand up under 
battle conditions? 


1 Very well 

2 Fairly well 

3 Not very well 
4 Not well at all 


Do you think that you are in good physical condition? 


1_____ I feel that way nearly all of the time 
2 I feel that way fairly often 

3 _____ I feel that way only once in a while 
4 I almost never feel that way 


Do you think that you are in tough enough physical condition for going into combat? 


1 Yes 
2 No 
3 Undecided 


Which of the following best describe your own feeling about getting into combat 
against the Germans? 


1 I'd like to get into the fight as soon as I can 
I’m ready to go when my turn comes 

I'd just as soon stay out of combat if possible 
I don’t want to get into combat at all 


2 
3 
4 
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that more than one variable is involved, as can be seen from the fol- 
lowing table: 
Question 16: “Which of the following best 
tells the way you feel about getting into the 
Question 16: “Do you _ actual battle zone?” 
feel you are now trained 
and ready for combat “I want very much to 
or do you need more _ get into it just assoon Other 
training?” as possible” and “I’m replies Total 
ready to go anytime” 


“T’m ready for combat 
now” 33 14 47 


“TI need a little more” 
and “TI need a lot more 
of some kinds of train- 
ing” 14 39 53 


Total 47 53 100 


The fact that we have substantial frequencies in all four cells 
indicates the presence of more than one variable. These two ques- 
tions cannot be scored together if one wishes to rank individuals on 
readiness for combat. Analysis of the content of these two ques- 
tions suggests that we have four types of individuals: (a) those who 
desire combat, and who feel they have had enough training; (b) those 
who desire combat, but who feel they need more training; (c) those 
who do not desire combat, although they feel they have had enough 
training; and (d) those who do not desire combat and who feel they 
need more trainng. The hypothesis for further testing would be 
that the desire for combat and the attitude toward adequacy of 
training are two separate areas each of which may be scalable sep- 
arately. To test this hypothesis, one would have to construct addi- 
tional questions in both areas and test them for scalability. 


2. Scalogram Analysis and Question or Item Wording 


Scalogram analysis provides a powerful test of the internal struc- 
ture of a series of items. In the field of attitude analysis and public 
opinion polling, the selection of items and the determination of their 
relationship to each other is a problem of primary importance. 
Scalogram analysis offers a complete picture of such interrelation- 
ships between items. 

In public opinion polling in particular, the problem of the formu- 
lation of a “valid” opinion question has revolved around the prob- 


UTILITY OF SCALOGRAM ANALYSIS 151 


lem of the meaning of questions. One of the basic concerns of all 
public opinion analysts is the interpretation which the respondent 
places upon the question asked. Rules have been suggested by 
many people on how to word questions. The pitfalls are well 
known, but even the experts are often in disagreement about the 
proper solution. Scalogram analysis does appear to offer some 
help on this problem. Applied during the pretest stages of an opin- 
ion study, scalogram analysis can contribute greatly to the inter- 
pretation of the meaning of an opinion question. 

Some of the many contributions of scalogram analysis to the prob- 
lem of question wording are discussed below. They all concern the 
basic question, ‘“How did the respondent interpret the meaning of 
this question?” This applies to both the question wording and the 
answer alternatives. The important theoretical point involved is 
that any single question asked is but a representative of all other 
questions that might have been asked instead. To determine the 
relation a particular question has to the universe of all opinion ques- 
tions on the same topic, it is necessary to pretest a series of questions 
characterizing the universe. On the basis of the scalogram pattern, 
one can then select the final question to be used. If it is desired to 
divide the population into but two or three groups with respect to 
their attitude, then the one or two questions that have the proper 
marginal frequencies could be used. If the population is to be 
divided into an upper half and lower half, then one or two items 
can suffice to provide this 50-50 split. Or, if it is desired to divide 
the population into thirds, the one or two items that will yield three 
ranks with 334 per cent of the population in each are the ones to be 
selected. Thus, only one or two items can be selected from among 
all characterizing items to be used in the final study to answer ques- 
tions like: How does the more favorable half of the population differ 
from the less favorable half with respect to their age, political party 
affiliation, etc., etc.? It is quite another matter, it should be clear, 
to answer the question: What per cent of the population is ‘‘favor- 
able?” For this, an objective zero point is needed, a problem which 
is studied by the intensity function discussed in Chapter 7. 

Once a question is selected for the final study, the researcher has 
the important assurance that this question has a single meaning to 
all respondents, and that it will rank respondents in the same rela- 
tive order of “favorableness” as any other questions that one might 
choose to ask on the same issue. 

a. Problems of interpretation. Scalogram analysis will help an- 
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swer such problems as: Did different people interpret the question 
differently? Was it double-barreled? Was the subject matter too 
complex or technical to be understood? Did the question contain 
some unforeseen implication? Was the question wording simple 
enough to be understood? Did the interviewer change the original 
meaning of the question? Did the context in which the question 
was asked affect its meaning? etc. All these problems involve the 
determination of how well the question fits into the parallelogram 
pattern, that is, of how many errors of reproducibility there are. 
If a question turns out to be but a simple function of the rank order 
on the entire scale, then we know that this question has a dominant 
single meaning which has not been destroyed by any of the compli- 
cations listed above. The amount of error gives us an objective 
test of the singleness of meaning which the questions have for the 
respondent. To be sure, scalogram analysis will not tell us why a 
question has errors of reproducibility, but it will tell us that there 
is more than a single interpretation present in the responses to the 
question. 

Any change in wording, or in the way the question is asked, which 
changes the meaning of the question by introducing a new and dif- 
ferent variable or factor, can be detected by its effect upon the 
scalogram. The introduction of a stereotype or overtone into a 
question which has the effect of changing the meaning of the ques- 
tion by bringing in some extraneous consideration would be indi- 
cated by an increase in the amount of error. If a question changed 
from a point of opinion to one of information or vice versa, this 
could also be determined. In this way it is possible to test the 
effect of changes in question wording or administration upon the 
meaning which the question has for the respondent. 

b. Problems of answer alternatives. An inseparable part of the 
question asked is the form of the answer categories.? In this re- 
spect we have such problems as: How many alternatives should be 
given? What order should they take? Should they be treated as 
separate categories or combined? If combined, which categories 
should be thrown together? etc. Scalogram analysis will indicate 
very clearly which answer alternatives have similar meanings and 
which differ and by how much. Scalogram analysis will show how 
the answer categories differ in degree. It will show which categories 
should be combined and what the effect of this combination will be 


2 The discussion is restricted here to the check-list type of answers. Scale analysis, 
of course, applies to free answers as well, and indeed to any form of collecting data. 


UTILITY OF SCALOGRAM ANALYSIS roy 
v 


upon the ability of the question to discriminate between scale types. 

The scale pattern of a question shows how the different answer 
alternatives differentiate between the rank order of individuals 
based upon the entire series of questions. If an answer category 
does not differ in degree from the answer category above or below it, 
this will be seen by the inability of this category to select a different 
group of individuals on the rank order from the adjacent categories. 
Similarly ‘no answer’’ and “undecided” categories can be combined 
with other answer categories on the basis of the general rank order 
of the respondents who give these answers. Actual illustrations of 
the use of scalogram analysis in the combination of answer cate- 
gories were given in the previous chapter. In this way it is possible 
to pretest the meaning of answer categories and to secure the best 
combination of alternatives. 


Advantages of Working with Scales 


The above section has pointed out some of the many ways in 
which a scalogram analysis can help in the study of a series of items. 
We have seen how the search for a parallelogram pattern can aid 
the research worker in the task of defining his problem and con- 
structing his items. Now we come to a more basic question, “Of 
what use is it to look for scales?” Stated another way, this ques- 
tion could read, ‘“What are the advantages of working with a series 
of items which can be shown to belong to a scalable universe?” 


The answer to be presented in this section deals with\two types _-~ 


of advantages, mathematical and conceptual. These advantages 
spring from the basic fact that a scalable series of items permits 
the determination of a rank order which is independent of the par- 
ticular sample of items used. The conceptual advantages will be 
discussed below in relation to the problems of description and inter- 
pretation, while the mathematical advantages will be most obvious 
in relation to the problem of prediction. 

In order to show the important analytical advantages of working 
with a series of items which form a scale, as opposed to a series of 
items which may not be scalable, let us look at these three types of 
problems: (1) problems in description, (2) problems in interpreta- 
tion, and (3) problems in prediction. Most research studies usu- 
ally involve at least one of these problems. Analysis by means of 
scales is helpful in each case. 

1. Problems in description. If the problem is one of describing 
an attitude or an opinion, the presence of the scale pattern permits 
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the following conclusions. A person’s score tells what his responses 
were to each question. Persons with the same score show the same 
scale characteristics. A person with a higher score is “favorable” 
on all questions that a person with a lower score answers “favor- 
ably,” and on at least one more in addition. This rank order, fur- 
thermore, exists not only for the given series of questions, but is the 
same as the rank order that would be obtained with any other series 
of questions in the same area. 

These properties are especially important in the case of a single 
opinion question analysis.? An opinion question selected from a scale 
will be known to divide people into the same relative rank order 
as any other question in that area. Knowing that a respondent 
“favored” a statement which few people ‘favored’ enables one to 
know that he would also favor all other statements in the same area 
“favored” by a larger number of people. 

Thus, one important advantage of a scale lies in the rationale it 
offers for the definition of a single continuum as discussed in detail 
in Chapter 3. The concept of a single continuum, while widely 
used in social research, lacks a clear-cut definition. Our definition 
of a single continuum as a series of items each of which is a simple 
function of the scale scores permits a clear-cut statement of what is 
meant by a rank order based on a single variable. The properties 
described above are a result of working with unidimensional uni- 
verses thus defined. While it is possible to get a rank order in non- 
scalable areas by assigning arbitrary weights to the items, this rank 
order does not possess the important invariant properties of a rank 
order where the problem of how much to weight each item does not 
come up. 

2. Problems in interpretation. Knowing that questions of similar 
content scale together shows them to be measuring a single attitude 
or opinion variable and warns one against attempting to interpret 
them as measuring different (albeit related) variables on the basis 
of their manifest content. This warning is particularly appropriate 


* For example, Cantril concludes in a chapter on “The Use of Breakdowns”: ‘For 
most members of the population, any refined attitude scale or rank-ordering is much 
too complicated to use in the usual interviewing situation. ... If several relatively 
simple questions can be devised to tap the same variable, then the answers of a single 
individual on these several questions can be pooled to place him on the scale with re- 
spect to the total population.” Scale analysis satisfies perfectly the condition of the 
latter sentence—several relatively simple questions can be used and tested for scal- 
ability. 

Hadley Cantril (editor), Gauging Public Opinion (Princeton University Press, Prince- 
ton, N.J., 1944), p. 191. 


UTILITY OF SCALOGRAM ANALYSIS 166 


when one is tempted to infer some “‘causal’’ connection from a cross 
tabulation. The public opinion literature is full of such “‘causal’’ 
explanations. Two opinion questions are cross tabulated to show, 
for example, that individuals who do not want OPA continued are 
among those more likely to object to government subsidies to keep 
pricesdown. A hypothetical cross tabulation might look as follows: 


Question: Do you approve or disapprove 
Question: Do you approve of the continuation of OPA? 
or disapprove of the govern- 


ment paying money to farm- 

ers in order to keep prices 

down? Approve Disapprove Total 
Approve 50 0 50 
Disapprove 30 20 50 
Total 80 20 100 


An inference that might be made from this table would be that 
approval of a policy of subsidies “‘leads’”’ one to approve also of the 
OPA. Theoretically one can never infer “cause” from the cross 
tabulation of two attitudes or opinions gathered at the same time.‘ 
However, opinion analysts make this inference quite often. Doubt- 
ful as such an inference is at all times, it is even more doubtful when 
the scalogram pattern showing a zero frequency in the positive- 
negative cell is obtained. Such a pattern strongly indicates the 
presence of but a single variable, i.e., attitude toward governmental 
control, with the possible interpretation that both questions are 
simply different aspects of the same one thing. The interpretation 
of such a pattern can best be made in terms of rankings on a single 
opinion continuum, rather than in terms of a relationship between 
two different variables. 

3. Problems in prediction. If an area is scalable, then it can be 
used very simply as either the criterion or predictor in a prediction 
problem. 

If the scale items are to be predicted from outside variables, the 
researcher knows that any result based upon the cross tabulation of 
one of the scale questions with some other variable (e.g., a back- 
ground characteristic such as age or sex) will be essentially the same 
as the result obtained from any of the other questions in the series. 
It is therefore often unnecessary in making ‘breakdowns’ to tabu- 

* A good discussion of this problem is given in E. Greenwood, Experimental Sociology 


(King’s Crown Press, New York, 1945). 
5 See Chapter 3 for a complete description of this pattern. 
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late more than one of the questions in the series. For example, if 
a series of questions is known to scale, then all questions in the scale 
will show men more “favorable” than women, if such is the case. 
Of course, the most sensitive differentiation will be obtained by use 
of the scale scores themselves. The single scale question which will 
show the highest relationship will be that question whose marginal 
frequency most closely approximates the marginal frequency of the 
outside variable. 

If an outside variable is to be predicted from the scale items, the 
multiple correlation on the scalable area is equivalent to the zero- 
order correlation with the scale score. Only the simple correlation 
with the scale scores need be computed, yet the full predictive power 
of the area is realized thereby. The multiple correlation must al- 
ways be equal to the simple correlation of the outside variable with 
the scale ordering of persons. 

The importance of this property for the problem of prediction 
weights is obvious. In a scalable series of items, the same predic- 
tion weights (namely, the scale weights) may be given to the items 
regardless of what the outside variable may be. This is not the 
case for a nonscalable area. If a series of items do not scale, then a 
new set of weights must be computed for each new prediction prob- 
lem. In this way scale scores provide a quantification of the attri- 
butes or responses that is invariably most efficient for predicting 
any outside variable whatsoever. 

Furthermore, it is known that adding more items or questions 
from the same area will not increase the multiple correlation. The 
maximum predictability in the area can be determined simply by 
means of the selected sample of items or questions. Since much of 
public opinion analysis implicitly involves predictions on the basis 
of only one or two questions in an area, this feature of scale analysis 
is of utmost importance. One can assign simple weights to a small 
number of questions from the scalable area with the assurance that 
the prediction result would be the same as that based upon all the 
possible questions from the same area. Adding more questions to 
a sample from a scalable universe will not increase predictive power 
substantially, so that relatively few items are highly efficient in 
practice for retaining all the predictive power the universe has for 
the criterion. 


Scale Analysis and the Measurement of Intensity 


Finally, scale analysis provides the basis for the further study of 
a component present in scalable areas—intensity of feeling. With 
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the rank order of individuals along a scale continuum from more to 
less “favorable,” there is implied a related rank order of individuals 
along an intensity continuum from more to less “intense.’”?’ The 
measurement of this intensity component and its application to atti- 
tude and opinion analysis will be discussed in detail in Chapter 7. 

To summarize the utility of scalogram analysis, we find that this 
utility springs on the one hand from the use of the scalogram anal- 
ysis for the detection of scales, and on the other hand from the 
use of scales in the analysis of one’s data. In relation to the de- 
tection of scales, scalogram analysis helps to define one’s prob- 
lems in terms of smaller subareas or in terms of typological analyses. 
Scalogram analysis also aids in the construction of items by en- 
abling one to test these items for meaning or “‘bias’’ and by indi- 
cating how response categories can be combined. In relation to 
the advantages of working with scales, we find that problems of 
definition, interpretation, and prediction are greatly simplified if 
the items can be shown to come from a scalable universe. 


SECTION III 


FURTHER RAMIFICATIONS OF SCALH ANALYSIS 


Work on the present theory of scale analysis has been limited 
almost entirely to the field of attitudes and opinions. While scale 
analysis is formal and applies to any type of social science data, the 
empirical applicability of scale analysis to these other groups of 
data remains to be investigated. It may be helpful for future re- 
search to report some of the experiences which have been encoun- 
tered in studying scales in the field of attitude and opinion research. 


The Occurrence of Scales 


A question which is often asked is, “How often do you find scales 
in practice?” Quite obviously, if the rigid parallelogram pattern 
required of a scale did not occur empirically, then the theory would 
have very little practicality. There is a real question, then, as to 
whether scales occur frequently enough to be applicable to the 
study of social attitudes. 

As has been indicated earlier, [the bulk of social phenomena is too 
complex for one to expect many aspects to be scalable. | It would 


* Since this was written there has appeared the interesting example of scalability of 
institutionalized discrimination. See Gilbert Shapiro, ‘“Myrdal’s Definitions of the 
‘South’: A Methodological Note,’ American Sociological Review, Vol. 13, No. 5 (October 
1948), pp. 619-621. 
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be interesting if we could, on the basis of the experience of the 
Research Branch, estimate to what extent and in what kinds of data 
scalability occurs. Unfortunately such an estimate is not war- 
ranted at the present time. The work of the Research Branch was 
of such a nature due to the immediate problems of wartime research 
that, in many cases, where a series of items proved to be nonscalable, 
no further attempt was made to see if the conceptual universe could 
be redefined into subuniverses which might prove to be scalable. 
The thorough application of the present theory of scale analysis 
would call for many pretests and revisions of both the content uni- 
verse and the sample of items. - For example, if a small proportion 
of items from a sample does not conform to the scale pattern that 
seems to be indicated by the rest, the discrepancies would have to 
be studied further, perhaps by detailed interviews, to see what the 
disturbances might be due to. In some cases, it may be found that 
a subuniverse is indicated that must be treated apart from the rest 
of the universe; in other cases it may be found that there have been 
mistakes in wording or presentation of the questions. : Thus fmany 
cases which were classified as nonscalable upon first examination 
may have proved, upon additional research, to be eesiable or to be 
divisible into scalable subuniverses.] 

However, while it was a much more frequent. experience not to 
find a series of items scalable according to the four criteria set forth, 
there were certainly enough instances of scalable areas to .warrant 
further research with the present theory of scale analysis..: It might 
be helpful for future research to list some of the areas which upon 
first approximation did not prove scalable. Several of these non- 
scalable areas were attitude toward “sweating outa jump” (pre- 
sented in the previous section), attitude toward the Russians, atti- 
tude toward internationalism (postwar foreign policy), attitude 
toward obeying Army rules and regulations, attitude toward the 
Army Score Card Plan (later broken down into two scalable areas, 
attitude toward the idea of the Score Card Plan and attitude toward 
the administration of the Score Card Plan—see Chapter 7 for de- 
tails), attitude toward future results of the war, attitude toward 
mail censorship, attitude toward veteran preference for government 
jobs, attitude toward civilian support of the war effort, attitude to- 
ward the Army’s concern with personal welfare, attitude toward use 
of athletes and: entertainers in the Army (later broken down into 
two scalable dreas—attitude toward athletes and attitude toward 
entertainers). Examples of several scalable areas were given in 
detail in the previous chapter. 
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A promising avenue for future research might consist of a classifi- 
cation and examination of those areas in social science which prove 
scalable or nonscalable. It would be particularly helpful to have 
some systematic analysis of how certain areas, and even individual 
items which proved nonscalable upon first approximations, were 
later modified to form scalable areas or items. The present data 
must be taken almost entirely as illustrative of the present theory of 
scale analysis—especially in relation to the intensity component to 
be presented in Chapter 7. A great deal of work has yet to be done 
upon the empirical occurrence of scales. 

It is quite obvious from the results obtained by the Research 
Branch that many attitudinal areas are so complex as to be non- 
scalable. The possible existence of other systems of order than 
that of the perfect scale is a problem which must be studied further. 
Another type of structure which permits of a rank ordering of the 
people—albeit with a different meaning from that of the perfect 
scale—is that called a quasi scale, as is discussed next. 

Quasi Scales 

[it may happen that a series of items appears to measure a strong 
common variable, but that there are too many errors of reproduci- 
bility rmit adequate reproducibility of responses from scale 
scores. e errors of reproducibility may be caused either (a) by 
one or two other important variables that may be in the area, or 
(b) by many small variables. \ The existence of one or two addi- 
tional variables in the area is indicated by nonscale types in the scale 
pattern. If such definite nonscale types exist, then the multiple 
correlation of an outside variable with the whole area would not be 
quite equivalent to the simple correlation with rank order on the 
dominant variable, and would be attained only by taking the non- 
scale types into account. On the other hand)if errors of reproduci- 
bility are random, then the multiple correlation of any outside vari- 
able on the area will be precisely equal to the simple correlation with 
the rank order on the area. This property, it is important to note, 
holds no matter how low the reproducibility <s. \ 

Fsomne areas which are not scalable in terms of reproducibility are 
called quasi scales; their reproducibility may not be high but their 
errors occur in a sort of gradient.\ This gradient pattern of errors 
indicates that, while there is not a single factor operating as in the 
case of a scale, nevertheless there is a single dominant factor and 
indefinitely many small random factors, so that prediction of_any 
external variable must rest essentially on the dominant factor. [The 
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dominant factor is measured by the quasi-scale scores. | This means 
that although quasi scales lack an essential property of a scale-rank 
order, i.e., they cannot reproduce the respondent’s characteristics 
on the items in the area very well—nevertheless, the rank order is 
perfectly efficient for relating any outside variable to the area. | 
Therefore, if examination of the errors of reproducibility shows 
them to conform to a certam gradient pattern, and not to be grouped 
together to form nonscale types, then we have what may be called a 
quasi scale. The pattern of responses indicates that there is a single 
dominant factor measured by the quasi-scale scores. 

The diagram on the next page will serve to illustrate the difference 
between random scale errors, grouped nonscale errors, and gradient 
quasi-scale errors. 

The error pattern of the “‘true’’ scale question is recognizable from 
the random nature of the few errors of reproducibility that do occur. 
The cutting point between “‘positive’ and “negative” categories 
would fall between respondent 18 and 19. Predicting all respond- 
ents from 1 to 18 to have given “‘positive’’ answers would result in 
two errors (respondents number 11 and 16); predicting all respond- 
ents from 19 to 50 to have given “negative” answers would result 
in three errors (respondents number 22, 25, and 31). Reproduci- 
bility of this item would therefore be high—5 errors out of 50 pre- 
dictions. 

The error pattern of the nonscale question is recognizable from 
the way in which the errors are grouped together. Again locating 
the cutting point between “‘positive’’ and “‘negative” categories be- 
tween respondents 18 and 19, we find that predicting all respondents 
from 1 to 18 to have “positive” answers would result in five errors, 
and furthermore that these five errors were grouped together among 
respondents 8, 9, 10, 11, and 12. Similarly predicting respondents 
19 to 50 to have given ‘“‘negative”’ answers, also results in a grouping 
of six errors among respondents 39, 40, 41, 42, 43, and 44. Repro- 
ducibility of this item would therefore be low—11 errors out of 50 
predictions—and, more important, these errors would be grouped 
together to form nonscale types. The grouping of errors into non- 
scale types indicates that more than one strong variable is present. 
We cannot find any single rank order of respondents that would 
successfully represent the attitudes of respondents to both variables. 

The error pattern of the quasi-scale question is recognizable from 
the manner in which the fairly large number of errors that occur 
gradually decrease in number as one moves further and further 
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away from the cutting point. These errors are much more frequent 
than “true” scale errors, but do not group together like nonscale 
errors. Again locating the cutting point between respondents 18 


RANDOM SCALB GROUPED NONSCALD GRADIENT QUASI- 
ERRORS SRRORS SCALE MERORS 
Respondent Categories Categories Categories 
Rank Order + - + - Se es 
1 x x x 
2 ».¢ x x 
3 ».¢ x x 
4 x x x 
5 x x x 
6 x x x 
7 x x x 
8 ».¢ x x 
9 x x x 
10 x x x 
11 x x x 
12 x x x 
13 x x x 
14 x x x 
15 xX x 
16 x x x 
17 x x x 
18 x x x 
19 x x x 
20 x x x 
21 x x x 
22 x x x 
23 x x x 
24 x x x 
25 x x x 
26 x x x 
27 x x x 
28 x x x 
29 x x x 
30 x x x 
31 x x x 
32 x x x 
33 ».¢ x x 
34 x x xX 
35 x x xX 
36 x x x 
37 x x x 
38 x x x 
39 x x x 
40 x x x 
41 x x x 
42 x x x 
43 x x x 
44 x x x 
45 x x x 
46 x ».¢ x 
47 x x x 
48 x xX x 
49 x x x 
50 x x x 
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and 19, we find that predicting ‘‘positive” responses for respondents 
1 to 18 would lead to six errors, and that these errors would occur in 
groups of three (respondents 13, 14, and 15), then two (respondents 
8 and 9) and finally one (respondent 5) in a sort of gradient as one 
moved away from the cutting point. A similar gradient of errors 
would occur in the “negative” response predictions for respondents 
19 to 50; three errors among respondents 30, 31, and 32, two errors 
among respondents 37 and 38, one error for respondents 42 and 48 
respectively. Reproducibility of this item would be low—13 errors 
out of 50 predictions—but more important, these errors would not 
be grouped together into nonscale types, but would consist of small 
groups of errors gradually decreasing in size as one moved away 
from the cutting point. 

The Research Branch has found evidence of the quasi-scale pat- 
tern in the case of personality inventories, information tests, and 
measures of intensity of feeling. An example of such a quasi scale 
is given by the scalogram on page 164 based upon an aggregate of 
fifteen items symptomatic of psychoneurotic tendencies (Scalogram 
10). Most items have a large number of errors, although there is 
enough nonerror to permit the appearance of a fairly definite paral- 
lelogram pattern. The coefficient of reproducibility is .73 (this 
could be raised somewhat by dichotomizing the response categories). 
In enerallthe errors for each category conform to a gradient, errors 
decreasing gradually as the ranks depart from the region of highest 
density of response for the tea) 

The importance of a quasi scale lies in how it is used for external 
prediction problems. While we cannot derive a person’s responses 
from his quasi-scale score,| the score does yield a zero-order corre- 
lation with any outside variable which is equivalent to the multiple 
correlation on all the items in the quasi scale. [The prediction of the 
external variable rests essentially on the dominant factor that is 
being measured by the quasi-scale scores. | Thus a quasi scale has 
the full mathematical advantages of a scalable area. 

the case of a quasi scale, it is ordinarily necessary to use a 
much larger sample of items than in a scale in order to obtain the 
proper rank order of individuals\, 

What implications does the quasi scale hold for scale measure- 
ment? Quasi scales seem to enable one to take a large number of 
items which have some strong common content and to derive from 
their intercorrelations a score which permits a rank order inde- 
pendent of item weights. It would thus appear that quasi scales 
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offer a promising avenue of research into some complex areas which 
are neither scalable nor divisible into scalable subareas. While 
the single dominant variable of a quasi scale cannot be represented 
by means of a small number of items due to the amount of error in- 
volved, increasing the number of items which contain this dominant 
variable makes this error assume a gradient pattern, and permits 
an invariant rank order. 

One interesting way in which quasi scales can be seen to arise is 
as follows. Suppose an attitude universe can be broken down into 
many subuniverses, each of which is scalable. Suppose further, 
that when the scores on the subuniverses (which can now be treated 
as quantitative variables) are intercorrelated, they are found to 
correspond to Spearman’s single common factor structure. Then 
it must follow that if an item is selected from each of the subuni- 
verses, and a scalogram formed of these items, the scalogram will 
reveal a quasi scale; and furthermore, the quasi-scale score will be 
essentially the score on the Spearman common factor. 

The Research Branch did have a nonscalable universe—morale 
—for which it found several subuniverses to be scalable, such as 
pride in one’s outfit, satisfaction with one’s job, confidence in one’s 
leaders, etc. However, the intercorrelations of scale scores were 
found not to conform to the Spearman structure, but rather to 
indicate more than one common factor. This explains why morale 
was not found to be even a quasi scale, and shows that no single 
ordering of men is possible for morale as a whole. 


Sampling the Universe of Items 


L4 basic assumption of the proposed theory of scale analysis is 
that a sample of items which were found to be scalable can ade- 
quately represent the unlimited universe of similar items. An im- 
portant property of a scalable universe is that the ordering of per- 
sons based on a sample of items will be essentially that based on the 
universe. Theoretically, then, if a series of ten items were found to 
be scalable, then all items of the same content would also be scal- 
able. 

[The process whereby the initial ten items are constructed is an 
intuitive one.) One turns over in one’s mind many possible items 
which characterize the universe in which one is interested. Ten of 
these are selected as a sample of all possible items that might occur 
to the research worker, and these ten are then subjected to a scale 
analysis. these ten items scale, the assumption is made that all 
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Scalogram No. 10. Psychoneurotic Inventory 
Questions and Answer Categories 


. Do you have any particular physical or health problem? 


6 Yes 
7 No 
8 Undecided 


Have you ever been bothered by shortness of bre: th when you were not exercising 
or working hard? 


6 Yea, often 
7 _____ Yes, sometimes 
8 No, never 
. Are you ever troubled by your hands sweating so that they feel damp and clammy? 
6 Yes, often 
7 Yes, sometimes 
8 No, never 


Are you ever bothered by having nightmares (dreams that frighten or upset you 
very much)? 


6 Yes, many times 
7 Yes, a few times 
8 No, never 

. How often are you bothered by having an upset stomach? 
6 Nearly all the time 
7 Pretty often 
8 Not very often 
9 Never 

Have you ever been troubled by ‘‘cold sweats’? 

6 Yes, often 
7 Yes, a few times 
8 No, never 


. Have you ever had any fainting spells? 


6 Yes, several times 
7 Yes, a few times 
8 Never had any 
Have you ever been bothered by your heart beating hard? 

6 Yea, often 
7 Yes, a few times 
8 No, never 

. Have you ever been bothered by pressure or pains in the head? 
6 Yes, often 
7 —__ Yes, sometimes 
8 No, never 


Have you ever had spells of dizziness? 


6 Yes, many times 
7 Yes, a few times 
8 No, never 
Do you often have trouble in getting to sleep or staying asleep? 
6 Very often 
7 Sometimes 
8 Almost never 


. 
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49. Do your hands ever tremble enough to bother you? 


6 Yes, often 
7 Yes, sometimes 
8 No, never 
50. Do you ever bite your fingernails now? 
6 Yes, often 
7 ____. Yea, sometimes 
9 No, never 
51. Are you ever troubled by sick headaches? 
6 Yea, often 
7 —_—— Yes, sometimes 
8 No, never 
52, Are you ever bothered by nervousness? 
6 Yes, often 
7 ——— Yea, sometimes 
8 No, never 


the items that come to one’s mind on the same topic would also 
scale.\\If these ten items do not scale, by the same token the as- 
sumption is made that all the items also would not scale. \ This is 
an important assumption, since upon its truthfulness rests the whole 
concept of inferences about the universe based upon a sample from 
that universe. How can such an assumption be subjected to em- 


irical verification? 

"Obviously no fixed universe exists from which_gne could select 
random samples and test their representativeness. | However, the 
following suggestion is made for the construction of an ‘‘opera- 
tional” universe which would serve as an important test of the as- 
sumption of sampling representativeness. A universe is named 
and then the research worker compiles a series of perhaps 300 items 
which characterizes that universe. These 300 items then can be 
thought of as an “operational” universe. The assumption of sam- 
pling could then be tested by selecting at random subseries of 10 
items each and testing these for scalability. How many of these 
subseries scale or do not scale? This result could be compared with 
the scalability of the entire aggregate of 300 items. In this way, 
the assumption of representativeness could be tested. The as- 
sumption of the invariance of the rank order of scale types could 
also be tested by means of the correlations between subseries and 
the total series for scalable areas. See Chapter 8 for a more de- 
tailed discussion of this problem, and also for numerical examples of 
comparisons between subsamples of items. 
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The Determination of Characterizing Items 


The above test could also be expanded to include an experiment 
in the determination of characterizing items. | At present the defi- 
nition of content is entirely subjective. There is no way to decide 
whether or not an item belongs to the universe, except through a 
decision on the part of the investigator or a group of judges. 
Thus it is possible for one investigator studying an area, such as 
attitude toward war, to construct a series of items, to test these 
items for scalability, and to find that they are nonscalable, whereas 
another investigator studying the same area by means of a dif- 
ferent series of items might find them scalable. The answer of 
course would have to be that since the content of the items them- 
selves defines the area, these two investigators were studying dif- 
ferent areas despite the fact that the areas bear the same con- 
ceptual title. 

The unsatisfactory nature of this answer is obvious. [ Some less 
subjective method for defining the characterizing item would help 
in the selection of sample series) Additional research is greatly 
needed on this problem. One possible approach might include the 
use of a group of judges to select the characterizing items whose 
scalability is to be tested. A combination of the Thurstone tech- 
nique of evaluating the content of items and the present technique 
of scale analysis might prove very instructive.’ While it is prob- 
ably impossible to remove the element of subjectivity from the 
process of constructing characterizing items, it should be possible 
to decrease the danger of individual bias by means of groups of 
judges. 


The Halo Effect 


An interesting ramification of scale analysis which deserves fur- 
ther study is the relation between scale analysis andithe psycho- 
logical phenomenon observed in the field of rating ‘‘scales’”’ called 
the “halo” effect.| Enlisted men who are asked to evaluate their 
officers on a series of eleven questions dealing with apparently dif- 
ferent aspects of leadership appear to “carry-over” a generally 
“favorable” or “unfavorable” attitude, instead of answering each 


7 An example of this approach is given by A. L. Edwards and F. P. Kilpatrick, “A 
Technique for the Construction of Attitude Scales,” Journal of Applied Psychology, 
Vol. 32, No. 4 (August 1948), pp. 374-384. 
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question as though it dealt with a different aspect of the officer’s 
behavior (see Scalogram 2 in the first section). | It has often been 
found that questions asking for the evaluation of different groups 
of people on different traits, including rating schemes for single 
individuals, are usually so closely related that the researcher is 
forced to conclude that his subjects are rating some general trait 
rather than the specific traits_inherent in the manifest content of 
the items in his rating scheme.® | This phenomenon has often been 
called a “halo” effect. How does this “halo” effect relate to the 
present theory of scale analysis? 

(Scale analysis appears to afford a method for studying “halo” 
effect in the form of a test of the hypothesis that there is some single 
variable upon which the different evaluations are based.| It carries 
the “halo” concept into the entire field of attitude or opinion re- 
search and shows how only a single variable may underlie the re- 
sponses to what the analyst believes to be a series of opinion ques- 
tions on quite different aspects of a problem. And just as the 
interpretation of separate trait names must be treated with caution 
once the existence of “‘halo’”’ has been found, so it would seem the 
individual interpretation of separate opinion questions must be 
treated with caution once they have been found to belong to a 
single scalable universe. ‘‘Halo’’ in this sense may be studied by 
a test for single meaning. Scale analysis thus may be able to pro- 
vide an interesting rationale of how individuals can be ranked upon 
a single rating continuum based upon a sample of items. 


The “Relativity” of Scales 


Another important ramification of scale analysis concerns the ap- 
plication of scales to different populations or at different periods in 
time. (Scales are relative both to time and to populations,) A series 
of items or questions may form a scale for one population and yet 
fail to form a scale for another population. Or a series of questions 
may fail to form a scale for a total population, and yet scale for a 
subgroup of that population. Oty would therefore be possible that a 
nonscalable series of items could be found to form a scale when 
limited to a subgroup of the population. \ If such were the case, one 
could make an analysis of the structure of a content area in terms 

* Ag stated by Thorndike, “Ratings were apparently affected by a marked tendency 
to think of the person in general as rather good or rather inferior and to color the judg- 
ments of his qualities by this general feeling.’ Edward L. Thorndike, ‘Constant 


Error in Psychological Ratings,” Journal of Applied Paychology, Vol. 4, No. 1 (March 
1920), p. 25. 
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of the different meanings involved for different subpopulations. A 
useful technique for the determination of ‘deviant’ subgroups 
would be to sort out those individuals who were responsible for 
scale errors and to study their composition to see if they came from 
any particular subgroup. Scales based upon one population can- 
not be transferred to the study of some other population without 
first testing to see if the items still scale for the new population. 
There may be a great fallacy in many cases of applying scales which 
have been “standardized” on one population to the study of new 
populations. 

One suggested hypothesis for the relativity of scales is that the 
difference between the two populations is not one of degree, but of 
kind. One group is not simply more or less favorable than the 
other group, but actually defines the issue in a completely different 
way. | The items do not characterize the same universe to the dif- 
f populations. 

When this is the case, when a series of questions scales for one 
population but not for another, then comparisons among the popu- 
lations involving statements that one group is “more” or “less” 
favorable on the e universe than another group must be made 
with great caution.’ | This is an additional reason for studying atti- 
tudes and opinions by means of a series of questions rather than 
only a single question. In comparisons between groups based upon 
a single question, one cannot be certain that the different groups at- 
tach the same interpretation to the question asked. One cannot 
predict on the basis of a single question analysis that, if another 
question were asked on what was conceived to be the same issue, 
the rank order of “favorableness’’ among different groups might 
not change. For example, even if a single question were used only 
to compare different educational levels on degree of ‘‘favorableness”’ 
toward an issue, it would first have to be shown that the question 
had the same meaning, i.e., belonged to the same scalable universe 
for the different educational groupings, before any comparisons of 
degree of favorableness could be made. 

The above discussion of the relativity of scales according to the 
population studied applies equally well to the relativity of scales for 

® The test for scalability requires that the same question be asked of the different 
populations and that an attempt be made to combine answer categories in the same 
way, even though these combinations produce slightly more error than they would if 
each population were scaled separately. If a question scales for two populations, but 
with different combinations of answer categories, the conclusion would be that the quea- 


tion means the same thing to both populations, but that the answer categories do not 
represent the same degree of “favorableness.” 
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the same population when two different time periods are studied. 
A series of items may form a scale at one time for a given popula- 
tion and yet fail to form a scale at another time. Again, the sug- 
gested explanation is that the change in the population’s attitude 
or opinion with time is one of kind, rather than degree. People have 
not simply become “more” or “‘less” favorable; they have changed 
their definition of the issue. To measure a change in degree of 
“favorableness,”’ it would appear to be ne that the attitude 
remain in the same continuum both times. is hypothesis, if cor- 
rect, indicates a basic weakness in much of the present trend anal- 
ysis based upon repeating a single question) Unless it is known 
that the meaning of this question has not changed with time, the 
statement that a group has become “more” or “less’’ favorable is 
a doubtful interpretation. [Scale analysis affords a way to test 
whether or not the content of the questign has retained its original 
meaning. ) 

An especially important example of this problem in trend anal- 
ysis applies to experimental studies of induced change, i.e., the 
effectiveness of propaganda by means of before and after group 
tests. Unless the same scale is obtained before and after exposure, 
one should speak of changes in ‘“‘degree” of favorableness with great 
caution. This is especially evident when, for example, an item 
such as ‘‘All Russians are Communists,” which before the exposure 
scaled with other attitude-toward-Russia items, shows a great change 
but no longer scales. One hypothesis could be that this item has 
changed froin an attitude universe to an information universe, due 
perhaps to the stress of the propaganda exposure on the facts of the 
case. A test of this hypothesis might be to include other infor- 
mational items, before and after, and see if those items which ap- 
pear to shift from attitudinal to informational, no longer scale with 
the attitudinal items but do scale with the informational items. 


* * * * * 


The ramifications of scale analysis discussed above constitute a 
bare minimum of all the problems of scale analysis which remain to 
be fully investigated. The present research represents only an ini- 
tial attempt to formulate the foundations for the present theory of 
scale analysis and to develop a workable technique for the determi- 
nation of scales by means of this theory. As has been stated previ- 
ously, the present research is based upon data gathered in the course 
of the war and is limited almost entirely to the attitudes of soldiers. 
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The further testing of the present theory with other types of social 
data is an important avenue for future research. It can be ex- 
pected that future research will serve to show more clearly than is 
presently possible both the disadvantages and advantages of the 
present theory and method. 


CHAPTER 6 


RELATION OF SCALOGRAM ANALYSIS 
TO OTHER TECHNIQUES’ 


ARIOUS approaches to the study of attitudes and opinions have 

been made in the past. These approaches differ occasionally 
in objectives. In order to obtain a further perspective on the pur- 
pose and techniques of scalogram analysis, it may be helpful to com- 
pare this approach with previous ones. Emphasis will be placed 
primarily on similarities or differences in purpose and usefulness. 
We begin with a discussion of the distinction between scale and 
nonscale approaches to the study of problems. We then compare 
scale analysis with several other means of analyzing universes of 
data, e.g., the use of arbitrary indexes, item analysis, equal-appear- 
ing intervals, paired comparisons and factor analysis. 


Scale and Nonscale Approaches 


In order to understand better what types of problems are best 
studied by means of scales, it may be helpful to have a clear idea as 
to what types are probably not profitable for scale analysis. In the 
analysis of qualitative data, a vast number of problems definitely 
do not involve the notion of a scale. A great many nonattitudinal 
problems involving qualitative variables, like sex, religion, etc., usu- 
ally deal with each attribute as a complete variable in itself, and not 
as representing a universe of similar items. Such cases are not 
scale problems. On the other hand, many problems in social psy- 
chology regard a qualitative item, like an opinion or attitude state- 
ment, as but a sample from a universe of similar items. Here is 
where a scale analysis is helpful. It tells whether or not the uni- 
verse is scalable by using only a small sample of the items. Social 
phenomena are usually complex. However, if a scale is found to 
exist for a universe of phenomena, that means that a certain sim- 
plicity attends those phenomena. The theory of scales tells how 


1 By Louis Guttman. 
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to recognize and take advantage of that simplicity. If the phe- 
nomena do not follow a simple pattern, then scale analysis shows 
that a more complicated technique is needed to handle the data 
properly. 

A problem involving a universe of data, therefore, should first be 
subjected to a scale analysis. The analysis will show either that 
scales are involved or are not involved. A problem may be thought 
& priori to involve a scalable universe, but analysis of the data may 
show that a scale does not exist. The existence or nonexistence of 
a scale is not a criterion of the worth of a problem. If a problem 
turns out to involve nonscalable data, they should be treated, not 
as scales, but in whatever manner will yield a proper answer. 

To summarize briefly, problems which do not involve samples 
from a universe of items are not in general scale problems. Prob- 
lems which do involve such sampling—including almost all of atti- 
tude and public opinion work—will profitably be studied first by 
means of a scale analysis. If the universe is scalable, or can be 
broken down into scalable subuniverses, then it can be handled very 
easily by simple scale scores. How best to handle nonscalable uni- 
verses remains a far more complicated, and as yet unsolved, prob- 
lem. 

The distinction between scale and nonscale data may perhaps be- 
come even more clear when we consider the different roles that a 
factor or variable can play in a given research project. Research 
usually has either or both of two purposes: description and predic- 
tion. Let us start with prediction. 

Two roles for variables in prediction studies. Each variable in a 
prediction study plays one of two possible roles. It can serve either 
as something to be predicted—in which case it is often called techni- 
cally a criterion—or it can serve as a predictor. There are many 
possible ways in which each of these two roles can be assumed, such 
as in multiple and in partial correlation, but this should not be 
allowed to obscure the fundamental nature of the roles. 

Each of these two roles can be filled by qualitative variables (at- 
tributes) or by quantitative variables. Conventional statistical 
textbooks concern themselves largely with the prediction of quanti- 
tative variables. If the predictors are also quantitative, then 
linear or curvilinear regression equations are derived for testing 
predictability. If the predictors are qualitative, then the approach 
of analysis of variance is used. 

Relatively little attention has been given in the past to the case 
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where qualitative variables are the thing to be predicted; and very 
little attention has been given to the notion of a universe of qualita- 
tive variables, either as a criterion or as a predictor. Qualitative 
criteria need techniques different from those of quantitative vari- 
ables. Least squares and analysis of variance do not apply. Fur- 
thermore, the study of universes of variables requires a special ap- 
proach. Scale analysis studies a universe of qualitative data in a 
method appropriate to such data. It proves that, if the universe 
is scalable, it can be handled very easily in either role, whether as 
criterion or as predictor, by a simple scoring scheme. On the other 
hand, if the universe is not a scale (nor a quasi scale), then use of an 
arbitrary scoring scheme will not in general permit the universe to 
fill either role properly. 

The problem for analysis, then, is to ascertain the complete multi- 
variate distribution of an indefinitely large universe of attributes. 
This means finding out how each respondent stands on each of the 
possible items. Here is where scale analysis comes in. If the uni- 
verse of items is a scale, then each respondent’s pattern of responses 
can be represented by his scale rank among all respondents. From 
the point of view of description, an important feature of a scale is 
that a man with a higher scale score than another will be just as 
high or higher on every item in the universe (within scale error). 
He is known (within scale error) to be in the one answer category of 
each question that covers his scale rank. From the point of view 
of prediction as well as description, a crucial feature is that the dis- 
tribution of scale ranks based on only a sample of items is known to 
represent the complete multivariate distribution of the universe of 
items. A relatively few items can be used from a scale to represent 
the universe accurately either as a criterion or as a predictor. 


The Use of Arbitrary Indexes 


Although techniques for scale analysis are very simple, the ques- 
tion may be raised as to whether or not they can be dispensed with 
entirely. Ifa set of items is agreed to have a homogeneous content, 
to belong to the same universe, of what advantage is scale analysis? 
Why bother with determining the reproducibility of the area? Why 
not simply assign arbitrary weights to the responses to each ques- 
tion, add up these weights to obtain a total score which can be called 
an indez of the area in question? Cannot such an index be adequate 
for relating this attitude area to other behavior? This is often the 
solution reached by public opinion analysts who ask a “battery” or 
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series of opinion questions on what appears to be the same issue and 
then proceed to count up the number of “favorable” replies.? 

It is true that if an area is scalable, then the resulting scale scores 
will correlate very highly with any index obtained by arbitrary 
weights, provided the weights are in the right direction. Scale 
theory proves that there is no harm in obtaining an apparently 
arbitrary index from a scalable area, either for descriptive or pre- 
dictive purposes. That is why only very simple weights and scores 
are used in scale analysis in practice. But if the area is not a scale 
(or quasi scale), then we have an entirely different story. 

If an area is not a scale, then it may often be possible to break it 
down into subareas of content that are scalable separately. In 
such a case, each person should receive several scale scores, one on 
each subarea. The entire area can then be properly described by 
use of these several sets of scores. These scores can serve as the 
criteria if it is desired to predict the area from other variables; it is 
the profile of scale scores that is to be predicted in such a case. 
Similarly, if it is desired to use the items in the area as predictors of 
an outside variable, it is necessary only to determine the multiple 
correlation of that outside variable on the scale scores (which is 
very little work compared to computing directly the multiple corre- 
lation on all the items themselves). 

Omitting a scale analysis and just going ahead with a single arbi- 
trary index can completely obfuscate the purpose of the research, 
whether for descriptive or predictive purposes, if in reality several 
scores are required and not just one. There are at least five basic 
defects to using an arbitrary index if the area is not actually a scale 
(or quasi scale). These are: (1) Lack of descriptive meaning. 
(2) No criterion for weights. (3) Sampling of items and descrip- 
tion. (4) Improper weights for prediction. (5) Sampling of items 
and prediction. Each of these will now be discussed in turn. 

1. Lack of descriptive meaning. A first defect concerns the de- 
scriptive meaning of the index. In a scale, each score has a definite 
meaning with respect to the actual items. From a person’s score 
can be reproduced his response to each and every question (within 
scale error). As a consequence, it is meaningful to say that a per- 
son with a higher score than another has a more favorable attitude 
than the other person; this is true because a higher score means an 


* See, for example, F. Mosteller and H. Cantril, ‘The Use and Value of a Battery of 
Questions,”’ in H. Cantril (editor), Gauging Public Opinion (Princeton University Press, 
Princeton, N.J., 1944), pp. 67-73. 
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equivalent or higher response to each and every item (within scale 
error). This simple and direct meaning for scores does not at all 
obtain for nonscalable areas. No single index can reproduce a per- 
son’s responses to the items in such a case, so that no single index 
can be said to be an adequate representation of the area. Two per- 
sons can have the same score from quite different behavior. Fur- 
thermore, the fact that one person has a higher index score than an- 
other person has no particular meaning with respect to the content 
involved; the first person may be more favorable on some items but 
less favorable on other items than the second person. There is no 
clear meaning as to what a “‘higher’’ attitude is, nor even as to what 
“equality” on the attitude signifies. 

For example, one person might say, “I think Negroes should have 
a right to vote, but I don’t think they should be given skilled jobs.” 
Another might say, ‘I don’t think Negroes should have a right to 
vote, but I do think they should be given skilled jobs.”” Which per- 
son has the more favorable expression of attitude toward Negroes? 
Finding these two kinds of people implies that the area is not scal- 
able. As a consequence, no index score based on arbitrary weights 
will provide a meaningful ranking of attitudes of the population. 
As more and more items are added to these two on voting and jobs, 
and if the additional items continue to depart from a scale pattern 
so that many nonscale types of persons are found, any index score 
will have less and less content meaning; it will be less and less able 
to reproduce the responses to the items.* 

2. No criterion for weights. The lack of meaning of “equality” or 
“more favorable” and “less favorable” attitudes becomes even more 
evident when it is recalled that the weights in obtaining an index 
are arbitrary, which leads to a second basic descriptive defect of 
such an index for a nonscalable area. By changing the weights, 
persons who are “equal” according to one set will no longer be 
“equal” according to the other. Persons who were “higher” than 
others according to one set of weights may turn out to be “lower” 


* A good example of the distinction between arbitrary scoring and scale analysis comes 
from some unpublished data of Paul Wallin. He was studying adjustment in engage- 
ment. The items in the questionnaire were all judged to be homogeneous in general 
content, each item expressing some relevant aspect of adjustment in engagement. Yet 
scale analysis showed that the items should be divided into two subareas of content, one 
of which was a scale and the other a quasi scale. It was most striking to find that the 
scores on these two subareas correlated zero with each other! This proves that it is 
rather meaningleas to speak of general adjustment in engagement as a single variable, 
which an arbitrary scoring of the original questionnaire would have implied ; an arbi- 
trary index based on the combined areas would represent neither area. 
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on the other set. Which set of weights are the ones to be used? 
Shall the “right to vote” obtain a higher or lower weight than the 
“right to a skilled job” in attitude toward Negroes? There is an 
infinite variety of weights which can be chosen arbitrarily for an 
index, each of which may seem “reasonable.” If the area is not a 
scale, index scores obtained from different weights can have rela- 
tively low correlations among themselves. This is especially true 
where two subareas correlate zero with each other. What should 
the research worker do when confronted with several sets of weights 
in such a case? Without a frame of reference, such as provided by 
scale analysis, there can be no solution to the problem of what is 
best to do. 

3. Sampling of items and description. A third defect to omitting 
a scale analysis before obtaining descriptive ranks or scores for 
people arises from the problem of selection of items. Just as index 
scores can change in nonscalable areas when different arbitrary 
weights are used, in the saine way they can change if different items 
from a nonscalable area are used. If fifty items are selected from 
the area, then an index score on the first twenty-five can correlate 
very slightly with an index score on the second twenty-five if the 
area is not a scale, and the index score on the whole fifty can differ 
widely from the index scores that would have been obtained if a dif- 
ferent fifty questions had been used. This is especially true in the 
case of public opinion questions which consist usually of a series of 
less than ten questions. The correlation between indexes can be 
zero. There is no rationale for defining what is meant by ideal 
scores on the indefinitely large universe of items from which the 
sample of items were taken in such a case. 

One of the important features of scales is that this problem of 
sampling of items is minimized. If an area is a scale, then any 
sample of items from the area will yield essentially the same rank 
order for the people as any other set of items. Respondents to a 
public opinion poll will be ranked the same regardless of the specific 
questions asked. There is an ideal rank order defined for the infi- 
nite universe of items, and any sample of items from that universe 
will yield ranks that closely approximate the universe ideal. Know- 
ing that an area is a scale means that there need be no great concern 
about the sampling of items; but knowing the area is not a scale 
means that there must be great concern about the sampling of items. 
Remaining in ignorance as to the scalability of an area means re- 
maining in ignorance of whether or not to be concerned about the 
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sampling of items, and leaves the description open to serious criti- 
cism. 

4. Improper weights for prediction. The problem of sampling of 
items and the problem of weights are both felt with full force simul- 
taneously when it comes to problems of prediction. A fourth im- 
portant defect of the suggestion that a scale analysis is unnecessary 
concerns the use of the items in the role of predictors. If it is de- 
sired to predict an outside variable from a given set of items, then 
the best technique for doing this takes into account all the joint 
relationships within the items and between the items and the thing 
to be predicted. The best possible prediction, based on this com- 
plete information, is called multiple correlation or the discriminant 
function technique, depending on whether the thing to be predicted 
is quantitative or qualitative.‘ Let us restrict our discussion here 
to the linear case where the best prediction can be obtained by 
weighting the items and obtaining simple sums as prediction scores. 
If the set of items does not belong to a scale, then a new set of weights 
must in general be used to predict each different outside variable. The 
items must be weighted one way to predict one criterion and an- 
other way to predict another criterion if the best prediction in each 
case is to be obtained. The use of arbitrary weights to obtain an 
index score necessarily means, then, that the predictive power of 
items is not being realized. An arbitrary index in general will 
underestimate the predictability of any criterion from the items. 
The index can correlate practically zero with a given criterion, 
whereas, if the actual multiple correlation or discriminant function 
were worked out, the items could be found to correlate very highly 
with the criterion. 

Suppose a set of items on the role the United States should play 
in international relations proved not to be scalable. Suppose fur- 
ther that arbitrary weights were assigned to the items anyhow, and 
the respondents were scored according to the resulting arbitrary 
index. And suppose further that this index was found to have a 
correlation coefficient of .3 with, say, attitude toward England, 
where the latter attitude was scalable. It would be quite fallacious 
to say that attitude toward England is predictable from attitude 
toward international relations only to the extent permitted by a 


‘For a brief outline of multiple correlation and the discriminant function, see Louis 
Guttman, “An Outline of the Statistical Theory of Prediction,’’ in P. Horst, et al, 
The Prediction of Personal Adjustment (Social Science Research Council, New York, 
1941), pp. 253-312. 
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correlation of .3. The arbitrary weights used for the index are al- 
most certainly not the best for the purpose of predicting attitude 
toward England. If the best weights for this particular prediction 
purpose were computed by the technique of multiple correlation, the 
coefficient of multiple correlation could be much higher than .3. 
Instead of being negligibly predictable, attitude toward England 
might be highly predictable from the items in the area on inter- 
national relations. The arbitrary index scores can be completely 
misleading concerning the extent of this predictability. 

In the case of a scalable area, the multiple correlation or discrimi- 
nant function must always be given very approximately by the scale 
scores, regardless of what is being predicted. If a correlation with the 
scale scores is .3, then the multiple correlation coefficient with the 
separate items is necessarily very close to .3. Whether predict- 
ability is high or low will, of course, depend on what is being pre- 
dicted from the items; but regardless of what is being predicted, 
maximum predictability from the items is given by the scale scores. 
Knowing that the set of items is scalable means that it is known that 
the full predictive power of the items is being used in any situation. 
Knowing that the items are not from a scale means that it is known 
that the items must in general be reweighted for each new prediction 
problem. Not knowing whether or not the area is a scale, but just 
using arbitrary weights to obtain index scores means knowing that 
in many cases much of the predictive power of the items is being 
thrown away by the index. 

5. Sampling of items and prediction. A fifth drawback to not 
finding out the scalability or reproducibility of an area concerns the 
relationship of multiple correlation to the sampling of items. If the 
area is scalable, then it is known that not only the multiple correla- 
tion of the thing being predicted with the set of items is essentially 
the correlation of that thing with the scale scores, but also it is 
known that adding more items to the sample from the same scalable 
universe will noé increase the multiple correlation. That is why 
relatively few items can be used from a scalable area for prediction 
purposes, yet the predictive power of the infinite number of items 
from which this sample was drawn is being fully realized. For ex- 
ample, only five or six items from each of fifteen areas were used in a 
study of psychoneurotics in the Army. Using five or six hundred 
items in each of these areas instead of just the five or six would not 
have increased the multiple correlation. 
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This feature of scale analysis is of utmost importance in public 
opinion polling, where the problem of prediction from as small a 
number of questions as possible is of major practical importance. 

On the other hand, if an area is not a scale, then adding more items 
will in general increase the multiple correlation. There is no way 
of knowing when the maximum correlation has been reached. The 
possibility remains that more items in the same area can still help 
in a given prediction problem. There is as yet no way of deter- 
mining what the multiple correlation would be on the infinite uni- 
verse of items from only a sample of items if the area is not a scale 
(or a quasi scale). That means that one cannot speak of the pre- 
dictability of an outside variable from the items except as an under- 
statement. The multiple correlation on the universe of items is 
certainly as big as that obtained in the sample but may be much 
bigger. A sample of items may yield just a small multiple corre- 
lation whereas in fact the universe may yield a very high correlation. 

Knowing that an area is scalable means that the problem of sam- 
pling of items for prediction purposes can be simply handled; the 
maximum predictability in the area can be determined just by 
means of a sample of items. For nonscalable universes, there is the 
ever-present possibility that the predictive power of the whole area 
is far greater than the predictive power of the sample of items used; 
and multiple correlation based on a sample of items from a nonscal- 
able universe can be very misleading. It follows that a correlation 
based on an arbitrary index derived from the sample will be even 
more misleading, because this correlation will underestimate in turn 
even the sample multiple correlation. 

The successive shrinkage in predictability due to the sampling of 
items and to the use of an arbitrary index for nonscalable areas can 
be visualized as follows: 


Correlation 
with arbi- 
trary index 
of sample 
of items 


Multiple correlation 


Multiple correlation on a eample of it 


on a nonscalable > 


* : from the nonscalable 
universe of items 


universe 


The retention of the full predictive power of a scalable universe by 
a sample of questions, on the other hand, can be visualized as fol- 
lows: 
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Item Analysis 


Similarity of purpose. The purpose of scale analysis is to test the 
hypothesis that a universe of items comprises but a single factor in 
the sense that from but a single set of scores (on this factor) the re- 
sponses to each of the items can be reproduced. The widely used 
technique of item analysis often has a similar descriptive motivation 
in that it is desired to test whether or not a set of items is related to 
but a single factor. However, important distinctions exist between 
the scaling and the item analysis approaches in at least two basic 
respects: one is the manner in which the objective is defined, and 
the other is the resulting difference in techniques. 

If both approaches are regarded as being directed toward the 
same problem of describing internal consistency, then it will be 
found in the following discussion that at times item analysis may 
entirely miss the mark. This may seem like a radical criticism of 
item analysis in view of the fact that the technique is so widely used 
in the literature. The criticism will not seem so radical,’ however, 
when item analysis is viewed in its historical perspective. Item 
analysis, with respect to attitudes, seems to have originated in a 
different context from that of exploring the structure of a set of 
items. It was in the field of predicting an outside variable that item 
analysis was initiated and it is in this field that it is actually helpful 
and proper to a large extent. When the newer field of attitude 
analysis was developed, item analysis was borrowed rather uncriti- 
cally in order to describe the internal consistency of attitude items. 

The present theory provides a coherent approach to scale analysis 
from the point of view of internal consistency. We will attempt to 
show that the borrowing of the item analysis technique is no longer 
advisable. 

Proper use for a single prediction problem. Item analysis is often 
useful in a situation where it is desired to use the items as predictors 
of a given outside variable. The best way to accomplish this pre- 


* This criticism is actually not original here. It was made many years ago in the 
mental testing field, where much more attention has been given to the rationale of 
analyses than in the attitude testing field. 
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diction is by means of multiple correlation or the discriminant func- 
tion. But to do the complete multivariate analysis required by 
these techniques is a laborious, time-consuming, and often prohibi- 
tive undertaking when many items are involved. Item analysis 
has been shown empirically to be able to provide a short cut in se- 
lecting a relatively small subset of items that will predict the cri- 
terion with substantial accuracy even though it is known that pre- 
dictive efficiency from the whole composite is lost thereby. Item 
analysis is carried out in practice by various techniques; but they 
all consist of relating each item individually to the external variable, 
and then selecting and weighting each item according to its separate 
correlation. The prediction variable arrived at by means of item 
analysis will not be as good in general as the one arrived at by the 
complete multivariate analysis required by multiple correlation or 
the discriminant function, but it may sometimes be adequate for 
the investigator’s purpose. It must be remembered, though, that 
if the same items are to be used to predict some other criterion in a 
new problem, then a completely new item analysis must be per- 
formed on all the original items. A new selection and weighting 
must be made in general for each new prediction problem. 
Improper use for studying internal consistency. The use of item 
analysis for saving work in a prediction study can be quite proper 
and helpful. It is quite another matter, however, to use item anal- 
ysis as a technique for studying the internal consistency of a set of 
items. The problem of describing internal consistency can eventu- 
ally tie in with the problem of prediction, but it must do so without 
reference to any particular outside criterion. A scalable area, for 
example, because of its pattern of consistency, can be used in a very 
simple fashion to predict any outside variable whatsoever; the simple 
scale scores are invariant quantifications of the items for predicting 
any external criterion. But the problem of describing internal con- 
sistency itself must not be confused with the problem of external 
predictions. According to scale theory, the problem of internal 
consistency is stated to be: Can each item inside the area be repro- 
duced from scale scores? The amount of reproducibility is com- 
puted directly by a simple counting up of errors; this is a proper 


¢ Throughout this paragraph, it is assumed that a large sample of persons is used for 
the computations. If a relatively small sample of persons is used, item analysis may 
be better than multiple correlation when it comes to predictions for a new sample of per- 
sons. This has been shown theoretically in Louis Guttman, zbid., and it has been shown 
empirically in Louis Guttman, “Two Studies in Weighting Techniques,” op. cit., pp. 
349-364. 
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way to measure the correlation of a qualitative variable with any 
other variable. 

Item analysis, as employed now for the study of internal con- 
sistency of a set of items, does not seem to have such a clear ra- 
tionale in mind. The reasoning seems to be, instead, somewhat as 
follows. The items are all thought to be indicators or measure- 
ments of some central variable. The problem is to obtain persons’ 
scores on this variable. If this central variable could be observed 
by some external means, then of course a good way of estimating 
persons’ scores through the items would be to work out the multiple 
correlation of this variable on the items; and in order to save the 
labor of multiple correlation, item analysis might be employed to 
select and score the items to obtain total estimates of the central 
variable. If there really were such a variable which could be de- 
fined apart from the items, this statement of the problem resolves 
again into prediction of an outside variable. But unfortunately, 
research workers do not observe a central variable external to the 
items; and now reasoning seems to be of the following nature. Since 
there is no external variable with which to perform an item analysis, 
why not obtain a variable from the items themselves? This can 
then be used in lieu of an external variable, and an item analysis can 
then be performed. 

To obtain a variable from the items themselves as a starting point, 
arbitrary weights are assigned to the items, and a total score is ob- 
tained by summing the weights for each person. These total scores 
are used as though they were external to the items, and then the 
relationship of the total score to each item is examined by any of 
several item analysis techniques. 

It would, of course, be foolish to work out the complete multiple 
correlation of the total score on the items, since it is known in ad- 
vance that it must be perfect; the total score is a perfect function of 
the items, being an exact sum of them. What sense, then, does 
item analysis have in this context of internal consistency? It can- 
not be regarded as an approximation to multiple correlation because 
there is no need for such an approximation in this case where the 
multiple corre'ation is known (to be perfect). Does the item anal- 
ysis accurately describe the internal consistency of the data then? 
The answer again must be in the negative, if by internal consistency 
is meant the reproducibility of the items from the total score.’ 


7'This has also been recognized earlier in the field of mental testing. In criticizing 
the use of various critical ratios (which are typical techniques in attitude item analysis) 
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Item analysis does not attempt to see how well items can be repro- 
duced from the total score, but rather it attempts to do just the op- 
posite; item analysis investigates how well the total score can be 
estimated from each item separately. This is a crucial distinction. 
Scale analysis regards the score as a representation of the items; 
the score is to be a means of representing the items in any situation, 
which will be possible if each of the items is perfectly related to the 
score. The converse is done by item analysis; here, the item is sup- 
posed to represent the score. 

Being reproduced versus discriminating. It can easily be seen that 
even though the items form a scale, so that each item is a perfect 
function of the score, nevertheless the score can be negligibly corre- 
lated with any one item. For example, consider a dichotomous 
question from a scalable universe in which 60 per cent of the people 
said “Yes” and 40 per cent said “No,” and where ‘‘Yes”’ is more 
favorable than ‘‘No.”’ Then it must be that everybody in the top 
60 per cent of rank order on the entire scale said ‘“‘Yes’’ to the ques- 
tion and everybody in the bottom 40 per cent said “‘No.”’ Each 
person at a given rank has the same response to the question. The 
converse is not true. To each response to the question correspond 
many ranks. If a person said “‘Yes” to the question, we cannot 
know his rank very closely from this information alone; it can be 
anything between the fortieth and one hundredth percentile. If 
he said “No” his rank could be anything between the zero and 
fortieth percentile. This is not a very high correlation of the rank 
with the response, whereas the correlation of the response with the 
rank is perfect. Item analysis would discover that the mean rank 
of those who said ‘“‘Yes’’ is different from the mean rank of those 
who said “No” (or some equivalent relationship if ranks are not 
used). Finding such a mean difference says litile about the reproduci- 
bility of the item from the rank. A large mean difference between 
ranks can be found for nonscalable items, as well as for scalable 
items. Testing to see whether an item will discriminate between 
total scores has little to do with testing to see whether an item can 
be reproduced from a total score. Item analysis will find that items 
“discriminate’’ regardless of scalability. All that is required is sig- 


in intelligence item analysis, Guilford states that “the amount of overlapping of high 
and low groups . . . gives a truer picture of discriminatory power.” J. P. Guilford, 
Psychometric Methods (McGraw-Hill Book Co., Inc., New York, 1936), p. 434. He 
uses the word “validity” in this context where we would use the word “scalability.” 
Studying the overlapping of high and low groups is tantamount to studying reproduci- 
bility 
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nificant correlation ratios of scores on items, which says nothing at 
all about reproduction of items from scores. 

It seems clear, then, that if it is intended to study the internal 
structure of a set of items, item analysis is quite beside the point. 
Item analysis should be used only in a situation involving an ex- 
ternal criterion, where the items are intended to be only aids in this 
one prediction problem. It is misplaced when used in connection 
with internal analyses. 


Equal-Appearing Intervals and ‘Cafeteria’ Questions 


The procedure of equal-appearing intervals. One of the most widely 
used approaches at present for the descriptive analysis of attitude 
items is the method of equal-appearing intervals as developed largely 
by Thurstone.* This approach differs considerably in its rationale 
from that of scale analysis. Indeed, the two theories can be con- 
sidered as each being devoted to a distinct and separate problem, 
and each is appropriate for its own purpose. 

Briefly, the procedure for equal-appearing intervals is to have a 
group of judges rank a set of statements in the way they think will 
best fit the population of individuals to be studied. The items are 
always in a dichotomous form. A declarative statement like ‘“The 
British are fine people” is made, and the response of each subject 
to whom the questionnaire will eventually be administered is to be 
either an endorsement or lack of endorsement of such a proposition. 
The task assigned the judges is to determine weights for the en- 
dorsements of a series of such statements. This is usually done by 
having the judges rank the statements into eleven piles. If a judge 
believes that endorsing a statement means a very favorable response, 
then he will place it in a pile with a high rank. If he believes an 
endorsement means a very unfavorable response, he will place it in 
a pile with a low rank. 

This ranking into piles is done by a group of judges. The con- 
sistency of the ranking of the judges is then analyzed from two 
points of view. One is to reject items which are deemed to be 
ambiguous or otherwise faulty because of wide disagreements in the 
rankings of the judges. The other is to assign weights to the state- 
ments. The weight given to a particular statement is usually the 
median of the rankings assigned to it by the judges. 

The judges assign the weights, not according to how they them- 


*See L. L. Thurstone and E. J. Chave, The Measurement of Attitude (University of 
Chicago Press, Chicago, 1929). 
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selves would respond to the questions, but rather in terms of how 
strong an opinion or attitude they believe endorsement of the state- 
ments would imply for the subjects who are to take the question- 
naire. The selection of items and weights, therefore, is intended to 
reflect the judgments of the judges as to the role that endorsement 
of each proposition plays in the attitude of the subjects. 

The final selection of items is then administered to a sample from 
a population of individuals. The statements that each individual 
endorses have their weights added up, and the average represents 
the individual’s score on the attitude represented by the items. A 
person with a higher score is said to have a higher attitude, and a 
person with a lower score is said to have a lower attitude. 

Difference in rationale. In so far as the consistency of judgments 
of the judges is being studied, the problem of equal-appearing inter- 
vals is quite different from that of scale analysis. In scale analysis, 
there is no concern with the behavior of judges. The entire con- 
cern is with the behavior of the ultimate respondents. It is the 
consistency of the respondents that is studied by scale analysis and 
not the consistency of judges’ judgments. It has been found in at 
least several studies that there is no necessary relationship between 
the weights assigned by judges and the pattern of behavior of the 
respondents.® From a person’s score obtained by the method of 
equal-appearing intervals, one cannot in general reproduce his re- 
sponse to each of the questions. Two people with the same score 
can have quite different patterns of responses. In other words, 
selection and weighting of items according to equal-appearing inter- 
vals does not at all necessarily yield a scale in the sense of reproduci- 
bility from scale scores. 

Difference in utility. Asa corollary to the fact that treating items 
by the method of equal-appearing intervals does not necessarily 
yield a scale of respondents’ behavior, all the problems concerning 
nonscalable areas also can attend these items. If it is desired to 
predict an outside variable from the items, then using the weights 
given by judges will in general underestimate the correlation that 
would have been obtained if the specific multiple correlation weights 
were used instead. The scores obtained from judges’ weights, in 
so far as they reflect the behavior of the judges and not of the re- 
spondents, do not represent the responses of the respondents and 


* See, for te references in Hugh Carter, ‘““Recent American Studies in Attitudes 
Toward Wer: A Summary and Evaluation,” American Sociological Review, Vol. 10, 
No. 3 (June 1945), pp. 343-352. 
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hence are not an invariant quantification of the respondents’ an- 
swers for external purposes. Predictive power of the items will in 
general be lost by the use of judges’ weights. 

The method of equal-appearing intervals is helpful in studying 
the judgments of people in comparing various things. It can be 
regarded as a variation on paired comparisons (which is the next 
technique to be discussed below). It does not seem very appro- 
priate for studying the internal consistency of the actual responses 
of respondents. Consistency of judges and consistency of re- 
spondents are two distinct and separate problems. 

“Cafeteria” questions. The lack of correspondence between judg- 
ments of responses and scalability of responses can also be found 
in a technique related to that of equal-appearing intervals. A com- 
mon device used in opinion polls is the “‘cafeteria” type of question. 
Here, several statements are prepared which are often judged to 
represent different degrees of favorableness on an issue, and the 
respondent is asked to indicate which statement comes closest to 
expressing his opinion. For example, one such question used re- 
cently in national public opinion polls is the following: 


With which one of these statements concerning postwar relations with 
Russia do you come closest to agreeing? 

1. It is very important to keep on friendly terms with Russia, and we 
should make every possible effort to do so 

2. It is important for the U.S. to be on friendly terms with Russia, but 
not so important that we should make too many concessions to her 

3. If Russia wants to keep on friendly terms with us, we shouldn’t dis- 
courage her, but there is no reason why we should make any special effort 
to be friendly 

4. We shall be better off if we have just as little as possible to do with 
Russia 


The polling agencies using this question apparently judged that en- 
dorsing the first statement implied a more favorable attitude than 
endorsing the second, etc. The ‘‘cafeteria’”’ question differs from 
the more complete treatment by equal-appearing intervals in that 
only a rank order of categories is attempted, and not a more precise 
metric.” 

In order to test the hypothesis that the four statements actually 
indicate degrees of favorableness, the Research Branch asked the 

* In a study reported by Cantril, judges were used to rate the various statements in a 
“cafeteria” question with the purpose of securing measurable distances between the 


different alternatives. See Hadley Cantril, “The Intensity of an Attitude,” Journal of 
Abnormal and Social Psychology, ‘Vol 41, No. 2 (April 1946), pp. 129-135. 
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question on Russia of a sample of 3,000 enlisted men in the following 
form: 


Do you agree or disagree with this statement? 

It is very important to keep on friendly terms with Russia, and we should 
make every possible effort to do so. 

Agree 

—__— Disagree 

Undecided 


Similarly, each man was asked if he agreed or disagreed with each 
of the remaining three statements in the origmal ‘‘cafeteria” ques- 
tion. Thus the original question was broken up into four questions 
which could then be tested for scalability. A scalogram analysis 
showed that these four questions did not form a scale. This would 
indicate that more than one dimension was involved in the cafeteria 
answer alternatives and consequently the question could not be 
used to rank respondents according to degree of favorableness 
toward Russia. 

A further comparison was made by asking the ‘‘cafeteria’” ques- 
tion as originally worded following the rewording into four ‘“‘agree- 
disagree” questions. Of all men who checked statement 4 (the 
most extremely anti-Russian statement) when asked in the cafeteria 
form, 49 per cent also “agreed’”’ with statement 1 (the most ex- 
tremely pro-Russian statement) when presented as a separate ques- 
tion. Again, of all men who checked statement 4 in the cafeteria 
form, more (77 per cent) “‘agree’”’ with statement 2 than with state- 
ment 3 (59 per cent) when presented as separate questions, which 
would indicate that statement 3 is less extreme than statement 2. 
These results confirm what was proved by the scale analysis, namely, 
that more than one dimension of response is being tapped by the 
cafeteria question. 

Judgment of the dimensionality of a ‘cafeteria’ question with 
respect to the actual responses seems to have the same basic weak- 
ness as judgments in the more complete equal-appearing intervals 
technique. A study of the interrelationships of the responses them- 
selves seems essential to determine whether or not a meaningful 
rank order is present. 

Since a “cafeteria” question obtains but one response from each 
person, it permits for no check on dimensionality. It seems ad- 
visable in general, then, to ask a series of ‘‘agree-disagree” questions 
(or some similar form), rather than to combine them into a single 
question. This provides several answers for each respondent, so 
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that a scale analysis can then be performed to test the hypothesis 
that but a single dimension is present. 


Paired Comparisons 


Difference in purpose. The purpose of scalogram analysis is to 
find out whether or not the attitude of people toward one thing (one 
universe of items) can be represented by a single rank ordering of 
people. Paired comparisons has a related but different purpose; it 
attempts to determine average differences between the population’s 
attitudes toward several things. Scale analysis tests the hypothesis 
that it is meaningful to say that one person has a higher attitude 
than another toward some one object. Paired comparisons tries to 
find out if the average attitude of a population toward one object is 
higher than its average attitude toward another object. Hence the 
goals of scale analysis and of paired comparisons are quite distinct. 

An important use of paired comparisons made by the Research 
Branch, for example, was with respect to the point system for de- 
mobilization. The problem was to assign weights to the four fac- 
tors considered important by the soldiers with respect to determin- 
ing who should be released from the Army first: length of time in the 
Army, length of time overseas, amount of combat, and number of 
children. Should a battle receive less weight or more weight than 
a baby? Should a month of service overseas count the same as a 
month of service in continental United States? These were the 
kinds of questions to be resolved in the light of the attitudes of the 
enlisted men. 

No single set of weights for these categories would be able to 
reproduce each man’s judgments, because people would be found 
who would give opposing judgments. Men in continental United 
States would tend to discount overseas service and combat; men 
overseas would tend to discount anything involving United States 
duty. Family men would tend to emphasize children; nonfamily 
combat men would tend to emphasize battle credit and discount 
children. 

If we wished, we could make a separate study of each of the four 
factors. A series of questions could be asked concerning, say, chil- 
dren alone. A scale analysis would test the hypothesis that atti- 
tude toward children, with respect to demobilization, was a scale. 
Similarly, attitude toward each of the remaining three factors could 


1 See Louis Guttman, “An Approach to Quantifying Paired Comparisons and Rank 
Order,” Annals of Mathematical Statistics, Vol. 17 (1946), pp. 144-163. 
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be tested separately for scalability. Studying attitudes toward 
each of the separate factors would be a scale problem, so that four 
distinct scale analyses could have been made. 

Paired comparisons, however, is not directly concerned with how 
people are ranked on each of the separate factors. It is not con- 
cerned with saying that one person has a more favorable attitude 
than another with respect to credit for children. Instead, it is inter- 
ested in an average comparison of the attitudes on the factors; e.g., 
on the average, are men more favorable toward credit for children 
than toward credit for battle? 

No single factor, or no single system of weights, will account for 
all the judgments of the men on the four demobilization factors. 
The problem of paired comparisons is usually a multifactor problem 
with respect to individual responses. People offer different systems 
of judgments according to their different attitudes toward each of 
the things being judged. 

If each of the four attitudes in the score card problem were scal- 
able, and if these four scales were not perfectly intercorrelated 
(which undoubtedly would be the case for these data), then it would 
be necessary to give each man four scores in order to reproduce his 
judgments, one score on each attitude. The final weights assigned 
by paired comparisons to an object would represent an average of 
the scores of the population on that object. If one object receives 
a higher weight than another, that means that on the average the 
population’s attitude toward the first object was higher than on the 
second. 

Paired comparisons inquires directly into the average differences 
between attitudes toward the things being compared, without any 
analysis of the internal structure of the separate attitudes (which is 
the problem of scale analysis). An average difference, like any 
other average, must be considered in the light of the dispersion about 
the average. The less correlated the attitudes are, the more dis- 
persion there will be in the comparisons, and the less reproducible 
are the judgments of each person from knowledge of only the group 
averages. The system of weights finally adopted for the score card 
is approximately that which would best help reproduce the set of 
judgments of each of the individual soldiers. It will contradict 
many of the judgments, but on the whole it is the best fitting single 
system of weights. Any different set of weights will contradict 
even more judgments than the weights used. 

From paired comparisons, then, one attempts to reproduce the 
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judgments of the people from the weights of the objects. In contra- 
distinction, scale analysis assigns each person a rank and from that 
reproduces his responses. 

A perfect scale would have one and only one factor, so that repro- 
ducibility would be perfect. Paired comparisons is usually not 
a single factor problem; all people rarely rank the objects in the 
same way, because their attitudes toward the things being compared 
are generally not perfectly intercorrelated. At best, paired com- 
parisons can reproduce the proportion of people who say one object 
should be higher than the other. The method of paired compari- 
sons, a8 developed by Thurstone and others," focuses entirely on 
reproducing proportions of judgments. It introduces an assump- 
tion of a normal distribution for the attitudes, and then proceeds to 
see if, by using this assumption, weights can be devised for the ob- 
jects being judged which will reproduce the table of proportions of 
judgments of each kind. The newer approach to paired compari- 
sons, as used in the score card problem, does not make any assump- 
tions about normal distributions or any other kind of distributions, 
but focuses rather on reproducing each individual’s judgment, not 
just the group proportions of judgments. In either case, it is the 
objects whose ranking is analyzed (in terms of average attitudes), 
and not the people. In scale analysis, it is the people who are 
ranked and not the objects. In paired comparisons, the weights for 
objects are averages which usually cannot reproduce individual 
judgments very closely. In a scale, from a person’s rank one can 
reproduce his responses to each item almost perfectly. 


Factor Analysis 


Factor analysis designed only for quantitative variables. In the 
field of mental testing, there has been developed over a number of 
years an approach to analyzing the structure of a system of many 
quantitative variables. Origmated as a single factor theory by 
Spearman, and developed into a multifactor theory by Thurstone 
and others, this approach is now being widely used in analyzing 
intercorrelations of many kinds of sets of variables apart from 
mental tests. If we are concerned with a universe of qualitative 
items, i.e., a large set of qualitative variables, could not factor anal- 
ysis serve as well to analyze the structure of this universe? What 
relationship would there be between results obtained from a Spear- 


13 For a discussion of the work of psychologists on paired comparisons see J. P. Guil- 
ford, op. cit., especially Chapter 7. 
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man-Thurstone factor analysis of a set of qualitative items and 
results obtained by a scale analysis of those items? 

Briefly, the answer to this question can be stated as follows: a 
factor analysis in the Spearman-Thurstone sense will fail to test 
adequately the scalability of qualitative data, because it was not 
designed for that problem. Scale analysis is designed as a single- 
factor theory for qualitative data. From a scale analysis it can be 
known what a factor analysis will show. The converse is not true; 
from a factor analysis it will usually be difficult, if not impossible, 
to know what a scale analysis will show. Scale analysis, as outlmed 
previously, makes a complete analysis of qualitative data, using no 
extraneous assumptions, and using only techniques appropriate to 
qualitative data. A factor analysis in the sense of Spearman and 
Thurstone was not designed for qualitative data and will not make 
a complete analysis of qualitative data; it will ordinarily use ex- 
traneous and often misleading assumptions, will not use techniques 
appropriate for qualitative data, and can lead to quite erroneous 
interpretations. 

The need for distinguishing qualitative from quantitative variables. 
Factor analysis in the Spearman-Thurstone sense was devised for 
the study of quantitative variables. When variables are quantitative 
they can often have linear regressions, in fact can have normal dis- 
tributions, and can have their interrelations measured by product- 
moment correlation coefficients. It is well known that in a normal 
multivariate distribution, all multiple and partial correlations be- 
tween any subsets of variables can be completely expressed in terms 
of zero-order correlations. Therefore, in studying quantitative vari- 
ables, especially those which have a normal distribution, it is suffi- 
cient usually to consider only the correlations between the variables 
two at atime. That is why, in factor analysis, only the matrix of 
zero-order correlations is analyzed. Higher order relationships are 
implicitly analyzed if the zero-order structure is known. If ra, Tss, 
and r,, are known, then immediately the following correlations are 
also known: Pey.s) Tzs.yy Tys.z) Tz.ys; Ty.xs) AD 7,2, Similarly, for the 
case of more than three variables, all partials and multiples are 
known from just the zero-order correlation coefficients. 

When it comes to qualitative data, two of the basic features are 
lacking which can be present in the quantitative case. There are, 
in general, no simple analytical expressions like linear equations and 
their associated product-moment correlation coefficients for ex- 
pressing relationships between items. Secondly, from zero-order 
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relationships, one cannot in general deduce higher order relation- 
ships very closely. The problems of prediction and correlation for 
qualitative data are quite different from that of quantitative data.¥ 

The distinction between a variable and its frequency function. Con- 
fusion may arise as to what is a quantitative variable because when 
one is dealing with a statistical variable, one is always dealing with 
not one variable but with two variables simultaneously. 

One variable is the content of interest; it is called variate, statis- 
tical variable, random variable, or stochastic variable. The other vari- 
able is the frequency function. The frequency function is always 
quantitative as it arises from a counting process. The variate itself 
may be either quantitative or qualitative; the fact that the fre- 
quency function is quantitative has nothing to do with the nature 
of the variate. Just because males and females can be counted does 
not imply that the variable sex is quantitative. It is this confusion 
between a variate and its frequency function that leads to improper 
phrases concerning “quantifying data.” Votes for president are 
not put in a quantitative form when frequencies are recorded. Vot- 
ing remains a qualitative act, even though it possesses a frequency 
function for a population. 

It is undoubtedly such attempts to speak of a variate and its fre- 
quency function both in the same breath that has led to many verbal 
misunderstandings. (The presence from the very beginning of two 
variables is also the cause for difficulty in learning the logic of un- 
certain inference. The simplest problem in this field must involve 
at least two variables.) 

The habit of least squares. Another of the difficulties in dealing 
with qualitative variates is a prevalent belief that qualitative classi- 
fications are in some sense “inferior” to quantitative classifications. 
This kind of belief will be found expressed even in the most recent 
textbooks. Attention has been paid, therefore, almost exclusively 
to the analysis of quantitative variates. Most of the work in ele- 
mentary statistics can be said to be devoted to elementary aspects 
of least squares. The arithmetic mean is used extensively since it 
minimizes squares of deviations; the standard deviation is used to 

43 In his classical textbook, An Introduction to the Theory of Statistics, Yule devoted 
the first five chapters to studying the multivariate distribution of qualitative variables. 
It may be regarded as quite strange that statistical textbooks in the social sciences, 
where qualitative data are so important, have not followed suit, but instead fail to dis- 
cuss material of this kind at all. A study of qualitative data as qualitative data could 
do much to help avoid the pitfalls of mechanically assuming that all data are quanti- 


tative and of mechanically using techniques and formulas developed and appropriate 
for only quantitative variables. 
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express dispersion; correlation indexes are defined in terms of ratios 
of variances; and so on. When qualitative variates do occur, they 
are examined ordinarily only from the point of view of how well 
they help predict quantitative variables, so that the focus again is 
on error in quantitative variates, measured in terms of squares of 
deviations. This is exemplified by the correlation ratio and the 
analysis of variance (including the ¢-test and related developments). 
Very little rigorous work has been done on the reverse problem of 
predicting qualitative variates. 

The history of the prediction of qualitative variates, especially by 
sociologists, psychologists, and educational psychologists, reflects a 
habit of trying to cast everything into least squares, thereby losing 
sight of the data being dealt with. It has not been clearly recog- 
nized that the definition of deviation or error for qualitative variates 
must rest on its own feet and not be a mere analogy. 

It has been pointed out elsewhere that a principle appropriate 
for predicting qualitative variates is that of maximum probability, 
namely, minimizing the number of deviations. We cannot speak 
of how far off one qualitative value is from another when there is no 
metric to begin with, but we can always speak of the number of 
times we are off. 

The habit of least squares must be abandoned if straightforward 
analysis of qualitative variates is to be made. Least squares may 
be useful as a mechanical tool later in the analysis—indeed some 
equations of scale analysis discussed in later chapters emerge from 
certain least squares considerations—but it is not the starting point 
nor guiding principle. 

Since qualitative variates, like responses to attitude questions, 
are by definition not numerical, any technique which involves add- 
ing numbers does not apply to the variate values. Qualitative 
variates do not have arithmetic means, standard deviations, nor 
product-moment correlation coefficients. Qualitative variates do 
have frequency distributions, but the distributions cannot be anal- 
yzed in terms of most of the statistics used for quantitative variates. 
A qualitative variate does have a mode, namely, the variate value 
with the highest frequency. If a qualitative variable is to be pre- 
dicted, the value used would be the modal value, because it mini- 
mizes the number of errors of prediction.“ A prediction is either 

4 Louis Guttman, loc. cit., especially pp. 258-263, 271-275. For an adaptation of 
this material, see J. P. Guilford, Fundamental Statistics in Psychology and Education 


(McGraw-Hill Book Co., Inc., New York, 1942), especially Chapter 10, pp. 176-197. 
% See Guttman, ibid., or Guilford, ibid, 
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right or wrong, so that predictability is measured simply by count- 
ing up the number of right and the number of wrong predictions. 
In quantitative variables, there is a numerical meaning to the size 
of error, which therefore permits computing things like averages and 
standard deviations; this is not the case for qualitative variables. 

The danger of tetrachoric correlation coefficients. A widely used 
practice with respect to relating two dichotomies is the use of tetra- 
choric correlation coefficients. This is a very convenient method 
of estimating the product-moment correlation coefficient between 
two quantitative variates which are known to have a normal bi- 
variate distribution. If there were no sampling error of people, the 
tetrachoric coefficient would be precisely equal to the product- 
moment coefficient for the population, and that is the only purpose 
in using the tetrachoric coefficient. It has no meaning apart from 
the product-moment coefficient it estimates. 

If data are qualitative and are not obtained by dichotomizations 
of quantitative variables, the use of tetrachoric coefficients can be 
very dangerous and misleading. It is all too easy to take the posi- 
tion that if a question has two possible responses like “Yes” and 
“No,” then these two responses should be regarded as artificial 
representations of some underlying quantitative variable which has 
a normal distribution. The important task the research worker 
should undertake is to test this assumption, instead of just accepting 
it uncritically. What is the variable of which this dichotomization 
is supposed to be a partition? How would one prove that such a 
variable exists? If the research worker cannot answer this question 
of how he would ever test such a hypothesis about an underlying 
variable, it would certainly not be safe to use the assumption. 

Actually, scale analysis is a technique for testing the assumption 
that such a question is a dichotomy which can be regarded as a 
slicing of a quantitative variable. Instead of making assumptions 
and proceeding from there, scale analysis has as its aim the proof or 
disproof of the hypothesis. The specification of any particular type 
of distribution, like the normal distribution, is completely absent, 
however, from scale analysis. The existence of a single variable un- 
derlying a set of qualitative variables can be established or disproved 
without any reference to the nature of its frequency distribution. 
Only rank order is needed, and rank order does not depend upon 
the shape of the distribution. 

Inconsistencies due to tetrachoric correlation coefficients. To illus- 
trate the importance of testing the hypothesis that there is an under- 
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lying quantitative variable instead of merely accepting it, let us 
consider the following distribution of three dichotomies. Suppose 
we have three questions to which the possible answers are ‘‘Yes”’ 
and “No,” and that for each question “Yes” indicates a “more 
favorable’’ attitude than “No.” Let us suppose that these three 
questions are asked of a population and that only four kinds of 
people are found, with frequencies according to the following table: 


Question 
Type of person 1 2 8 Frequency 
I Yes Yes Yes 10 
II Yes No Yes 30 
III Yes No No 40 
IV No No Yes 20 


100 


Out of 100 people, 10 said ‘‘Yes’’ to all three questions, 30 said 
““Yes’’ to the first and third questions but ‘“No” to the second, etc. 

If tetrachoric coefficients were to be computed between each of 
the possible pairs of these three variables, they will be found to be 
completely contradictory. There are three zero-order correlations 
possible: ris, 7:3 and r23. The table for ry can be obtained from the 
preceding table as follows. The number of people who said ‘“‘Yes”’ 
to both questions 1 and 2 is 10, namely all the people of type I. The 
number who said ‘“‘Yes”’ to question 1 and ‘“‘No”’ to question 2 is 70, 
the frequencies of both types II and III. 20 people, those of type 
IV, said ‘‘No” to both of the first two questions. No people at all 
said ‘‘No”’ to the first question and ‘“‘Yes”’ to the second. The four- 
fold table of frequencies for questions 1 and 2 therefore looks as fol- 
lows: 


Question 1 
Yes No 
Yes 10 0 
Question 2 
No 70 20 


The tetrachoric coefficient for this table is +1, since there is a zero 
cell in the upper righthand corner. If these two dichotomies actu- 
ally are slices of two quantitative variables which have a normal bi- 
variate distribution, then those two quantitative variables must 
have a perfect, positive, product-moment correlation coefficient. 
Let us now look also at the correlation between questions 1 and 3 
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and 2 and 3, respectively. The fourfold tables are computed from 
the trivariate table above and are as follows: 


Question 1 Question 2 
Yes Yes No 


No 
Yes 40 20 Yes 10 50 
Question 3 |__| Question 3 _—_—_—— 
No 40 0 No 0 40 


The tetrachoric correlation between questions 1 and 3 is —1, while 
between 2 and 3 itis +1. But these two correlations are impossible 
if ri, is +1 according to our first fourfold table. If two variables 
are perfectly correlated, the correlation of the first variable with a 
third variable must be exactly the same as the correlation of the 
second variable with the third variable. But here we have the 
paradox that the first two variables are perfectly correlated but 
have completely opposite relations with a third variable. 

What the above example shows is that these three questions can- 
not be slicings of quantitative variables which have bivariate nor- 
mal distributions with each other. The assumptions behind the 
use of the tetrachoric correlation coefficient cannot possibly hold 
here. 

While the example just given is very extreme, it should serve to 
point out the possibility of inconsistencies creeping into analyses if 
tetrachoric correlation coefficients are used without any test being 
made that the assumptions behind them are satisfied by the data. 
More generally, it can be proved that inconsistencies can arise when 
the correlations are more moderate in size than +1 or —1. It is 
well known that the matrix of zero-order product-moment correla- 
tion coefficients between any quantitative variables must be Gram- 
ian. This is one of the basic conditions which make factor analysis 
in the sense of Spearman and Thurstone possible. However, if 
tetrachoric correlation coefficients are computed between dichot- 
omies, then the resulting correlation matrix need not at all be 
Gramian. This is true even though the tetrachoric coefficients 
may be of the size ordinarily found for product-moment correlations 
in factor analysis problems. The reason is that zero-order relation- 
ships in general do not determine higher order relationships, and 
use of the tetrachoric involves only zero-order relationships. 

In studying a multivariate distribution, which is the problem of 
scale analysis, all relationships must be studied if a proper descrip- 
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tion is to be obtained. Scale analysis studies all relationships 
simultaneously, not just zero-order relationships. 

The measurement of error. Apart from algebraic inconsistencies 
that can arise from use of tetrachoric coefficients, perhaps an even 
more basic consideration is how to measure error of prediction or 
reproducibility. It so happens that in a perfect scale of dichot- 
omous items all the fourfold tables will have a zero cell in the proper 
place so that all tetrachoric correlation coefficients are always equal 
to +1. In this sense, tetrachoric coefficients could be used to test 
the hypothesis that a perfect scale was present. If all the coeffi- 
cients are not +-1, then either the area is only approximately scal- 
able, or is not scalable at all. But to test for a perfect scale, it is 
even easier to forget about any correlation coefficients at all and 
just to look and see if the proper cells have exactly zero frequencies. 

The hypothesis of a perfect scale, it is safe to say, will almost in- 
evitably be rejected in practice. There will almost inevitably be 
various kinds of errors present in empirical data. The practical 
problem is not to test whether a perfect scale is present, but rather 
whether an approximately perfect scale is present. What kind of 
error does an imperfect tetrachoric correlation imply? 

Since the tetrachoric coefficient is equivalent to a product-moment 
coefficient when the underlying assumptions are fulfilled, this means 
that error is implicitly measured in terms of least squares. If the 
assumption of normality is fulfilled, then an imperfect tetrachoric 
coefficient implies that the squares of the errors of prediction of one 
quantitative variable from the other are greater than zero, and the 
coefficient deviates from perfection according as the squares of 
errors increase. Thus, the coefficient is concerned not with the 
predictability of one dichotomy from the other, but of one quanti- 
tative variable from another. The prediction of the actual dichot- 
omies is more properly done by counting the errors of prediction, 
which is quite different from the least-squares prediction of the 
quantitative variable. For example, consider the following four- 
fold table between two ‘‘Yes’”-“‘No”’ questions: 


Question 1 
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How would we actually predict question 2 from question 1 in prac- 
tice? For this we must use the qualitative technique of modal 
probability.“ If a person says “Yes” to question 1, we would pre- 
dict him to say ‘‘No’’ to question 2; we would be right 40 times out 
of 70. If a person says ‘‘No” to question 1, we would again predict 
him to say ‘“‘No”’ to question 2, being right 20 times out of 30. All 
told, we would be right 60 times out of 100. A similar process 
would be used to predict question 1 from question 2, with the total 
correct predictions being 70 out of 100. 

In scale analysis, the problem, however, is not how well the items 
can be predicted from each other, but how well they can be pre- 
dicted from the scale score. If the above two questions are from a 
scalable area, then they should ordinarily be highly reproducible 
from the scale scores, with at least 90 per cent reproducibility. 
There is only one nonscale type, that in the upper right-hand cor- 
ner; and this has 10 per cent of the people. Scoring this type with 
either the ‘“Yes-Yes’’ or ‘“‘No-No” type will yield one error for each 
of these people. Hence there are ten errors out of two hundred 
responses, or 95 per cent reproducibility of the two items from them- 
selves. 

Of course, this reproducibility may be spuriously high; it will not 
necessarily hold up when more questions are added. Reproduci- 
bility is not to be determined in practice from only two questions. 
We use this little example here only to highlight the point that there 
is no necessary relationship between an indirect least-squares treat- 
ment of hypothetical quantitative variables, and a direct test of the 
reproducibility of the qualitative items themselves, when error is 
present. Scale analysis treats qualitative data qua qualitative data, 
and measures error accordingly. Tetrachoric coefficients do not 
deal with the items themselves, but bring in unnecessary (and un- 
tested) hypotheses which have no clear relationship to the problem 
of reproducing the qualitative data. 

Biserial correlation coefficients. A correlation coefficient that 
comes closer than the tetrachoric to suiting the purpose of scale 
analysis is the biserial correlation coefficient. This coefficient can 
test the hypothesis that each item is perfectly reproducible from the 
scale score. In a perfect scale, the biserial coefficient of each item 
with the scale score is +1. But again, in the case of a perfect scale, 
there is no point to worrying about correlation coefficients at all. 


16 See Louis Guttman, «bid. 
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Just looking to see if the proper zero cells are present is all that is 
necessary. 

The biserial coefficient falls down on two of the same counts that 
does the tetrachoric: its assumptions and its treatment of error. 
The biserial coefficient is used to relate a dichotomy to a given (con- 
tinuous) quantitative variable. It assumes that the dichotomy is 
in reality but a slicing of another quantitative variable, and that the 
two quantitative variables—the given one and the hypothetical one 
—have a bivariate normal distribution. If these assumptions really 
hold, then the biserial coefficient is equal to the product-moment 
coefficient of correlation between the two quantitative variables. 

Thus, like the tetrachoric coefficient, hypotheses about under- 
lying quantitative variables and normal distributions are intro- 
duced. Since such hypotheses may or may not be true, they could 
be tested if the biserial correlations are to be used. But testing the 
hypotheses requires determining the hypothetical variables empiri- 
cally. So we are back to the problem of scale analysis, of testing 
the hypothesis that all the items are slices of the same quantitative 
variable. 

The biserial correlations will do this if the scale is perfect. Their 
interpretations become obscure if there is scale error, since they do 
not treat the items as qualitative data. There is no exact relation- 
ship in general between a biserial correlation and a coefficient of 
reproducibility. Treating the data on their own merits, as in scale 
analysis, is simpler and more direct, besides avoiding assumptions 
of normality and the like. 

What correlation coefficients should be factored? If one desires to 
proceed mechanically to factor a set of qualitative items by using 
techniques that were designed instead for quantitative variables, it 
is necessary first to compute correlations between the items. What 
correlation coefficients shall be used? Several research workers 
have proceeded by using tetrachoric correlation coefficients. This 
tacitly means that the items themselves are not being factored but, 
rather, the hypothetical quantitative variables are. In order to 
prove that this procedure is proper even for hypothetical variables, 
it is necessary to prove that the assumptions underlying the use of 
the tetrachoric coefficients are justified. It is possible to disprove 
this assumption in some cases by proving that the matrix of tetra- 
choric intercorrelations is not Gramian. It is not possible to do the 
converse, however, to prove that the assumptions are correct by 
showing that the matrix is Gramian. The matrix can be Gramian 
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and the assumption still be false. The only way really to prove the 
assumption is to exhibit empirically the underlying quantitative 
variable for each of the items. If the research worker has not done 
this, then his use of tetrachoric correlation coefficients must be held 
in doubt. 

If the assumptions underlying the tetrachoric correlation coeffi- 
cients are erroneous, then of course a Spearman-Thurstone factor 
analysis will be misleading even with respect to hypothetical quan- 
titative variables. From a scale analysis, it can be predicted quite 
well what the factor analysis will show. If the items themselves 
have high reproducibility, then a factor analysis of tetrachorics will 
show, not one common factor, but several common factors. The 
tetrachorics will ordinarily not follow the single factor pattern be- 
cause they are not affected uniformly by scale error. Scale error 
in a dichotomy with a 90-10 split will ordinarily reflect differently, 
in a tetrachoric, from scale error in a 50-50 split. Items with similar 
marginals will tend to have higher intercorrelations than items with 
dissimilar marginals. Hence, if the sample of items happens to have 
several items in one range of marginals, and several items in another 
range, it might be expected that scale error will cause the tetra- 
chorics to reveal two common factors algebraically in the Spear- 
man-Thurstone sense. The tetrachorics will reflect not just the 
single scale factor, but all sorts of artifacts, since such coefficients 
are quite sensitive to scale error and to extraneous considerations like 
marginal frequencies. 

Other research workers have avoided the use of tetrachoric co- 
efficients and instead have used point correlation coefficients as a 
basis for a factor analysis. The point correlation coefficient is a 
product-moment coefficient so that the matrix of zero-order point 
correlation coefficients is always Gramian. It still does not follow, 
however, that a factor analysis of point intercorrelations will prop- 
erly describe the multivariate distribution of the attributes. In- 
deed, we shall now show that if the items form a perfect scale, so 
that each item is a simple function or cut of the same quantitative 
variable, then the point intercorrelations will be completely mis- 
leading. If there is but a single factor in the sense of scale analysis 
—that is, if each of the items can be reproduced from but a single 
quantitative variable—then a Spearman-Thurstone factor analysis 
of point correlations will never show this to be the case, but will in- 
stead exhibit apparently many factors. For convenience, let us 
consider four dichotomous questions. This time we consider the 
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four questions to form a scale so that there are five perfect scale 
types as shown in the following table: 


ion 
Type of person 1 2 8 4 Frequency 
I Yes Yes Yes Yes 10 
Il Yes Yes Yes No 20 
HI Yes Yes No No 40 
IV Yes No No No 20 
Vv No No No No 10 


The multivariate distribution of the questions as shown in the table 
follows that of a scale pattern. The fourfold tables for the zero- 
order correlations can be determined as indicated in the previous 
example. The point correlation coefficients computed from the 
fourfold tables are as follows: 


POINT INTERCORRELATIONS OF FOUR SCALE DICHOTOMIES 


Question Question 
1 2 8 4 
1 — 51 .22 11 
2 51 — 43 .22 
3 .22 A3 — 51 
4 1 22 51 _ 


In order for there to be a single common factor in the sense of 
Spearman, the tetrads must vanish. This vanishing occurs for 
only one kind of tetrad in the above table, but not for another. A 
vanishing tetrad is in the upper right-hand corner, consisting of 
the elements in rows 1 and 2, and columns 3 and 4: (.22)(.22) — 
(.43)(.11) = 0. A nonvanishing tetrad consists of the four elements 
in rows 2 and 3 and columns 1 and 4: (.51)(.51) — (.22)(.22) = .21. 
In general, the following is true for the matrix of point intercorre- 
lations of scalable dichotomies, where the questions are arranged in 
order of their marginals: all tetrads involving elements on only one 
side of the diagonal will vanish, but all tetrads involving elements 
from opposite sides of the diagonal will not vanish. Since there 
must be nonvanishing tetrads in general, Spearman’s criterion is not 
satisfied by a scale. 

This can be shown for the general case as follows. From the fact 
that there is a zero cell in each fourfold table from a scale, the for- 
mula for the point correlation simplifies as follows. If p; is the 
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proportion of people who give a ‘favorable’ answer to the jth item 
and p, is the proportion who give a ‘“‘favorable’”’ answer to the kth 
item, and if p,is less than p;, then the point correlation between the 
two items is 


Tih = (2) (+=) Dj S Pe 


If we let b; = i Pi and bh = i Pe , then the formula for the point 
Laie Lee 


J 
correlation becomes 


tik = i b Sh 


It will be recalled that in Spearman’s single factor theory, if aj and 
a, are the respective correlations of the jth and kth quantitative 
variables with the common factor, then the correlation between the 
two variables is 


Tyk = A;Qy G ¥ k) 


Comparing these two formulas for the correlation coefficient, it is 
clear that they are contradictory. If there is a single factor in the 
sense of scale analysis for qualitative data, there cannot in general 
be a single common factor in the Spearman sense, so that more than 
one apparent common factor will emerge from a Spearman-Thur- 
stone analysis. 

The components in a scale. If a set of dichotomies forms a scale, 
it is clear then that the Spearman-Thurstone factor analysis will not 
detect this fact. On the other hand, a scale analysis can tell us 
what to expect from a factor analysis and shows us in what respects 
the factor analysis will be misleading. The equations of scale anal- 
ysis as analyzed in Chapter 9 show further why a factor analysis 
using point correlations apparently discovers more than one factor, 
whereas in fact there is but a single variable present in the sense of 
scale analysis. These equations show that a scale of dichotomies 
can be resolved into many components. Each item can be regarded 
as having two quantitative values, zero and unity, which can be ex- 
pressed as a linear function of these components. In a certain sense, 
the first component is essentially the scale scores; the second com- 
ponent has been identified as the intensity function; the third and 
remaining functions have not yet been named, but it is known that 
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they are of an oscillatory character. The important feature of these 
functions is that they are all perfectly related, albeit in a curvilinear 
fashion, with the scale scores. The linear correlations of these func- 
tions with each other, including their correlations with the scale 
score, are all zero, yet they all are perfect curvilinear functions of 
the scale score. It is this paradox of having perfect curvilinear 
functions which are, nevertheless, linearly uncorrelated that throws 
factor analysis off. The Spearman-Thurstone approach to factor 
analysis is a completely linear one and is not adequate for analyzing 
the curvilinearities which are inherent in the scale pattern. The 
fact that a scale has a single factor but nevertheless many com- 
ponents which are perfect functions of that factor makes a Spear- 
man-Thurstone factor analysis inappropriate here. 

One of the interesting features of scale analysis is that, while it 
has the implications of these oscillatory components, nevertheless 
there is no need to worry about them in testing the hypothesis of 
scalability. The innocent-looking scale pattern is all that has to be 
analyzed by any of the several simple scale analysis techniques now 
available. In doing this, the mathematics of scale analysis are im- 
plicitly employed, but there is no need for the clerk to know any- 
thing about them. Factor analysis with its heavy computations is 
a vastly more tedious and expensive kind of procedure. Its com- 
plexity, however, does not mean therefore that it is appropriate for 
qualitative data. Factor analysis may be a proper approach for 
some problems involving quantitative variables. As for qualita- 
tive data, it is theoretically more desirable and in practice far simpler 
to analyze qualitative data qua qualitative data. 

Communalities and reproducibility. It may be worth pointing out 
some further differences between factor analysis and scale analysis 
with respect to how they intend to study the structure of their re- 
spective kinds of variables. Scale analysis tests a single factor 
hypothesis for qualitative data: can a single quantitative variable 
suffice to reproduce the response of each individual to each ques- 
tion? The efficiency of this reproducibility is measured by the 
simple per cent of correct estimates of persons’ responses from their 
scale scores. 

In the Spearman-Thurstone approach to factor analysis, even in 
the single factor theory it is not attempted to reproduce each 
person’s score on each quantitative variable from his score on 
the common factor. Actually, there is no single factor theory in 
the Spearman-Thurstone approach because of the distinction made 
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between common factors and unique factors. Each variable in a 
set is thought to be composed of not just the common factors, but 
also of a unique factor (which may be a composite of several specific 
and error factors). In order to reproduce a person’s score, then, on 
one of the original variables, one would have to know not only his 
common factor score but also his unique factor score. Since each 
variable has a unique factor, in general there are n + 1 or more 
factors implicit in a factor analysis of n variables, and not just one 
factor. The common factor does not account for the whole vari- 
ance of each variable, but only for part of the variance, which is 
called the communality. The purpose of the analysis is to find out 
if common factors can be found which will reproduce the correlation 
coefficients between the variables. The number of common factors 
does not depend upon the size of the communalities to any appreci- 
able extent. The communalities can be very low, yet one common 
factor may be found or many common factors may be found; or the 
communalities may be high, yet either one common factor may be 
found or many common factors. The correlations can be repro- 
duced perfectly even though the individual scores can be reproduced 
only very inaccurately from the common factors. Thus, a single 
common factor in the sense of Spearman and Thurstone for quanti- 
tative variables does not imply the same kind of reproducibility for 
its variables as the single common factor of scale analysis implies 
for its qualitative items. 

As a final comment, it might be pointed out that in the mental 
test field, where factor analysis originated and has been most widely 
used, the quantitative variables employed are themselves scores ob- 
tained from sets of qualitative items. An achievement test is a 
sample from a universe of qualitative data. It might be suggested, 
therefore, that before a factor analysis is applied to scores derived 
from sets of items, each test separately be first subjected to a scale 
analysis. If the test is not a scale (or quasi scale), then it should 
not be represented by only a single score in the factor analysis but 
perhaps should be broken down into subareas, each of which might 
prove to be scales or quasi scales. If tests used in a factor analysis 
are neither scales nor quasi scales, then it is hard to see how a factor 
analysis can show any clear-cut results. Perhaps one reason for 
low communalities is the fact that the tests used are not scales or 
quasi scales, so that the scores on them do not have much internal 
meaning. 

Generalized factor analysis: latent structure theory. The develop- 
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ment of the theory of latent structure for qualitative data by 
Lazarsfeld, which was also an outgrowth of the work of the Research 
Branch, enables us now to view factor analysis in a broader setting. 

The generalized theory of common factors—or of latency—as 
developed by Lazarsfeld, can be stated as follows. Consider a set of 
n manifest variables 271, v2, --- , Z» Whose structure is to be analyzed 
for a given population. These variables may be all qualitative, 
partly qualitative and partly quantitative, or all quantitative. 
That is, we are not at all limiting the forms of the variables. Some 
may be dichotomies, some trichotomies, some may have ten cate- 
gories with no particular order to them, some may have many or- 
dered categories, some may be discrete quantitative variables, some 
may be continuous quantitative variables. 

The basic hypothesis of the generalized theory is that the popu- 
lation can be divided into a number of subpopulations, such that 
within each subpopulation the n manifest variables are statistically 
independent. Each subpopulation can be called a latent category. 
The set of latent categories can be called the latent set. The latent 
set may be an attribute—that is, consist of nonnumerical and even 
nonordered categories; if in addition the manifest variables are also 
attributes, then we have Lazarsfeld’s theory of latent structure. 
The latent set may be a single quantitative variable; if in addition 
the manifest variables are quantitative (and if zero correlations also 
imply complete statistical independence), then we have Spearman’s 
case of a single common factor. The latent set may comprise more 
than a single quantitative variable—that is, be a vector variable; 
if in addition the manifest variables are quantitative (and if zero 
correlations also imply complete statistical independence), then we 
have Thurstone’s case of multiple common factors. 

All this can be stated analytically in terms of the frequency dis- 
tributions involved. Denote the latent set by \. If there are m 
subpopulations involved all told, then \ has m categories or values. 
The multiple frequency function of the n manifest variables for a 
given value of \ can be denoted by the usual notation for conditional 
frequency functions: f(z, 22, ---, t.|). If none of the mani- 
fest variables involved is continuous, then each value of f is an actual 
probability—such as in Lazarsfeld’s case. Otherwise, f represents 
probability density as is usual for continuous variates. 

Similarly, we can denote the distribution of each manifest variable 
separately for a given value of \ by the notation of conditional dis- 
tributions. The n conditional frequency functions are: f,(x | A), 
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fa(ae |), ---,fn(@n| A). Again, anf; denotes either an actual prob- 
ability or a probability density, according as the distribution is 
discrete or continuous. 

The basic hypothesis of statistical independence within subpopu- 
lations is then equivalent to the statement that the multiple con- 
ditional distribution is equal to the product of all the first order 
conditional distributions :” 


(1) f(a, Ta, °°° »%n| A) = fi(a1 | A) fala | d) ++ fa(tn | A) 


Equation 1 contains both the Spearman-Thurstone structures and 
the Lazarsfeld structures as well as possible mixtures of the two. 
As an algebraically degenerate case, it also contains the perfect 
scale. It is the basis of what Lazarsfeld calls ‘accounting equa- 
tions.”’ 

For the perfect scale, the latent set \ is simply the scale ranks or 
scores. The conditional frequency distributions are all degenerate 
in that they have only two probability values possible: unity and 
zero. All people with the same rank have the same response on each 
manifest variable; they are all in but one category of each z;, for 
which f; is then unity, and the remaining categories have their f,’s 
vanish. Furthermore, the nonvanishing f; form the parallelogram 
pattern of the scalogram. The scale types are defined in the left 
member of equation 1 by those values of the manifest variables for 
which f is unity. The nonscale types do not occur; for them f is 
zero. 

As Lazarsfeld has shown, the quasi-scale pattern may also be 
viewed as a special case of equation 1. The set A is the ranks or 
scores on the quasi scale. This time, however, the conditional dis- 
tributions are not degenerate; they have probabilities between zero 
and unity. Furthermore, the categories of the manifest attributes 
can be arranged in such an order that the conditional probabilities 
form a gradient within each manifest attribute. Thus, a person in 
a higher category of \ has a higher probability of being in a higher 
category of xz; than a person in a lower category of \, and this is true 
for all 2;. 

The perfect scale requires the discrete probabilities of zero and 
unity; it therefore implies that the manifest variables are discrete. 
Indeed, the practical work to date has been with attributes which 
are discrete manifestations. On the other hand, although quasi 


1 This is equation 1.1 in Chapter 10. 
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scales are also discussed in these chapters from the point of view of 
qualitative data, the concept holds even for continuous quantitative 
variables, including Spearman’s single common factor. 

If each of the manifest z; and also the latent \ are continuous 
quantitative variables, and if they have a normal multivariate fre- 
quency distribution, then the condition of independence of equa- 
tion 1 is equivalent to Spearman’s relationships between correlation 
coefficients: 


(2) Trnx = 0 (j # k) 


Equation 2 states that the correlation between any two manifest 
variables is zero within each subpopulation. This leads to Spear- 
man’s tetrad equations and the related algebra. Thus, for this 
quantitative case, an algebra of correlation coefficients suffices to 
study equation 1. 

That Spearman’s single common factor pattern is also a quasi 
scale can be seen from the regressions of the manifest variables on 
the latent variable or factor. The regressions are linear, so that a 
person with a higher value of ) is more likely to have a higher score 
on 2; than a person with a lower value of \, and this is true for all 
the manifest z;. (The case of negative regression coefficients can 
always be taken care of by rearranging the appropriate manifest 
values, which in this case means simply reversing the signs of the z; 
that require it.) 

Generalized factor analysis and latent structures. In Chapters 10 
and 11 below, Lazarsfeld develops the theory of latent structure 
where both the manifest variables and the latent set are qualitative. 
The structure of the latent dichotomy is the case where ) has only 
two categories, and more generally \ may have m categories with 
no particular order among themselves. The subpopulations may 
differ in kind rather than in degree as in the case of the perfect scale 
or quasi scale. No assumptions of rank order of any kind are re- 
quired for the population. 

For Lazarsfeld’s qualitative structures, the probabilities are all 
discrete. Each member of equation 1 consists of actual probabili- 
ties. Chapters 10 and 11 show how to study these probabilities by 
means of an algebra of matrices of frequency distributions. These 
matrices do not exist when the manifest variables are continuous, 
for then we have probability densities or ordinates, rather than 
actual discrete probabilities, in equation 1. An appropriate algebra 
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for the continuous case—if the multivariate distribution is normal 
—is that of Thurstone’s multiple common factors. 

We have already seen how Spearman’s structure is a quasi scale 
for continuous variates. If Spearman’s hypothesis is not satisfied, 
then the categories of \ do not form a single quantitative variable. 
Thurstone’s techniques are concerned with equation 2 where ) is a 
set of quantitative variables. This leads to a study of the matrix 
of correlation coefficients—not to be confused with Lazarsfeld’s 
matrices of probabilities—which is equivalent to a study of equa- 
tion 1 when the multivariate distribution is normal (so that zero 
correlations imply statistical independence). 

Thus we see again how the algebra required for qualitative data 
differs from that for quantitative (especially continuous) variables. 

On the other hand, from equation 1 we can see that it is possible 
always to apply the qualitative techniques to quantitative data! 
It is always possible to divide quantitative manifest variables into 
class intervals, even into dichotomies, without disturbing the struc- 
ture of equation 1. Or, for qualitative manifest data, it is possible 
to combine categories in any way one pleases without disturbing 
the form of equation 1. This theorem follows immediately by ob- 
serving that summation (for discrete categories) or integration (for 
continuous categories) over values of an z; on the right of equation 
1 merely replaces the f,’s of those values by the conditional prob- 
ability of the combined categories or of the interval of integration. 
The multiplication of probabilities on the right of equation 1 is 
undisturbed—so that exactly the same latent set holds for the combined 
data as for the original data. 

It is interesting that the property of invariance of structure under 
combination of categories holds not only for scales and quasi scales 
but for all latent and common factor structures. 

Dividing continuous manifest variables into intervals yields dis- 
crete probabilities, and these can be studied by Lazarsfeld’s algebra. 
Of course, no different latent categories can emerge that were not 
present in the original data. On the other hand, latent categories 
can be lost (through implicit combination) by the combination of 
manifest data. In the case of scales, we know that scale types are 
lost by combining categories. In the case of quantitative variables, 
it may be that whole common factors can be lost. In any event, it 
may well be worth exploring to what extent it is worth while to use 
qualitative techniques on quantitative data. At present, there is 
no method for nonnormally distributed quantitative variables; the 
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Spearman-Thurstone “factors” do not actually satisfy equation 1 
unless zero correlations imply full statistical independence. The 
nonmetric technique of, say, dichotomizing the variables and then 
using Lazarsfeld’s algebra does provide a method for such a case. 
Since common factors may be lost by dichotomization, it may be 
that a series of analyses, based on different kinds of dichotomiza- 
tions of the manifest variables, could tease out all the latent 
categories—if a latent structure holds at all. 

Justification for the combination of categories. One of the inter- 
esting and important consequences of equation 1 is the justification 
which it provides for a practical procedure used in scalogram 
analysis. If manifest items have three or more categories, it has 
very often been found necessary in practice to combine some of these 
categories to reduce scale error. The theorem of the previous sec- 
tion shows what the effect of such an operation is if the uncombined 
data have a latent structure. The scale defined by the combined 
data must be part of the original latent structure; no latent types 
are lost. Each scale type that emerges is also a latent category. 

What is lost by combining manifest categories is some possible 
latent categories that reflect factors additional to the scale variable. 
Such factors can be nonessential to the study and hence ignored, 
such as a factor of verbal habit described before. 

On the other hand, all that may be lost is but a difference between 
two quasi-scale types when two adjacent categories are combined 
and then found to fit a perfect scale. The categories may each sep~ 
arately be found to satisfy equation 1, but to have scale error, 
whereas their combination (which then must also satisfy the equa- 
tion by our theorem) will have no scale error. This may imply a 
“quasi-scalish” relationship between particular categories, though 
the perfect scale pattern holds for the data as a whole. 

Other possible structure theories. We now wish to point out three 
things which are not necessarily to be inferred from the foregoing 
discussion. First, the case of the perfect scale is not necessarily to 
be regarded as only a part of the theory of latent structure. Second, 
latent structure is not to be regarded as the only kind of structure 
theory possible for a universe of content. And third, the quasi-scale 
pattern as seen from the point of view of latent structure may not 
be the only type of quasi scale. 

That the perfect scale satisfies equation 1 is not surprising. In 
fact this is mathematically trivial. This triviality is not just a conse- 
quence of the degenerate probabilities involved, but of an obvious 
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corollary of equation 1. It is obvious that, if the set of manifest 
types is used for A, then equation 1 is satisfied no matter what the 
original frequency distribution is. That is, equation 1 is always 
satisfied mathematically if the manifest set is used for the latent 
set. 

In the perfect scale, there is a one-to-one correspondence between 
scale types and ranks, so that using ranks for \ is equivalent to 
using the manifest types. This is trivial as far as equation 1 is con- 
cerned. In the same way, for example, nonscalable data might 
have any values assigned each type, and these values used for A; 
this will always satisfy equation 1, no matter what the structure of 
the data. Or again, for quantitative variables, any nonsingular 
linear transformation of the manifest variables can be used for 
(including the original z; themselves), and equation 1 will always be 
formally satisfied no matter what the manifest variables are. 

Therefore, the importance of the scale pattern does not lie in that 
it satisfies equation 1. Its importance and its properties lie in the 
internal pattern of the scale itself, which has nothing whatsoever to do 
with satisfying equation 1. Equation 1 says nothing about rank- 
ing individuals or similar problems. It merely gives a definition of 
latency, and of latent categories. It so happens that for the per- 
fect scale, the latent categories are always manifest, so that the no- 
tion of latency adds nothing at all; it actually contributes nothing 
to the mathematics of scale theory (as, for example, developed in 
Chapter 9). The theory of order of scale analysis is thus quite inde- 
pendent of latent structure theory. The relationship between the 
two is mathematically trivial. Indeed, scale theory bas a mathe- 
matical relationship to another theory of structure that is radically 
different from that of equation 1. 

On the other hand, the fact that quasi scales can be viewed from 
the perspective of equation 1 is far from trivial. Here, the latent 
types and the manifest types do not coincide. Indeed, latent struc- 
ture theory is important only for those cases where the number of 
latent types is less than the number of manifest types. This condi- 
tion is satisfied by quasi scales, whereas it is not by perfect scales. 
It is true that the problem of order within quasi scales is not touched 
on by equation 1, but requires additional considerations; however, 
equation 1 introduces the latent categories upon which to base the 
additional considerations. 

There is a different theory of structure from that of Spearman- 
Thurstone-Lazarsfeld for which the perfect scale is a special case, 
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and nontrivially so. This theory does not require any notion of 
latency or common factors, but is entirely in terms of manifest vari- 
ables. It may be called a theory of conditional order. Its basic 
hypothesis is that the manifest variables can be arranged in such an 
order that, if one variable is held constant, then the immediately 
preceding and succeeding ones are statistically independent. This 
hypothesis is expressed by the formula: 


(3) S (as, Zu | tj) = fig(@s | 25) fuj(we | 2,) <j <k) 


It is not intended to enlarge on this theory here but only to point 
out that the scale parallelogram satisfies equation 3 nontrivially 
(albeit degenerately). Furthermore, if the z; are all quantitative 
and satisfy equation 3, then their correlation matrix can be proved 
to have principal components that satisfy the difference equations 
for scale weights of Chapter 9. Thus, the weight equations of 
Chapter 9 apply also to a general case of equation 3. The scale of 
dichotomies is the special case where the score equations also have 
the oscillatory properties proved in Chapter 9. 

The structural equation 3 in general is mutually exclusive of equa- 
tion 1, and it has a general mathematics of order that applies in par- 
ticular to the perfect scale. 

It remains to be seen whether other general structural equations 
besides those of equations 1 and 3 can be found that will also lead to 
alternative useful theories of quasi scales. There is still a great 
amount of work to be done on the general theory of gradient in 
responses. Latent structure theory deals with the gradient in 
terms of a gradient within latent categories. But there may still 
be an obvious gradient in manifest responses and yet the hypothesis 
of latent distance need not be satisfied. That is, subpopulations 
may be defined and ordered so that a manifest gradient exists, and 
yet statistical independence need not hold within these subpopula- 
tions. This is a problem that needs further research. 


CHAPTER 7 


THE INTENSITY COMPONENT IN 
ATTITUDE AND OPINION RESEARCH’ 


CALE analysis provides an answer to the problem of how to rank 
S people from high to low with respect to the particular area be- 
ing studied. From the rank order of a person it is possible to repro- 
duce his answers to all the questions. In the case of dichotomies, 
it is possible to predict that a person who endorses a more extreme 
statement must endorse all less extreme statements. More gen- 
erally, persons who endorse one statement must all have higher ranks 
than persons who do not endorse the same statement. Scale anal- 
ysis thus provides 4 rigorous definition of a unidimensional scale 
continuum. 

Given such a rank order, we can say that A is more favorable than 
B who is more favorable than C and so forth. But what can we say 
about A, B, and C with regard to whether or not they have a “favor- 
able” or an “unfavorable” attitude? One person can be more favor- 
able than another, yet both may in some sense be “unfavorable.” 
Can we fix any cutting point along the scale continuum which would 
permit the division of the population into “favorable” and ‘un- 
favorable,” rather than only a relative ranking in terms of degrees 
of favorableness? Are the two ends of a scale continuum opposite 
in any sense? 

This problem of the meaning of the different extremes of a scale 
continuum is clearly recognized by McNemar, who states, ‘‘Another 
aspect of the single continuum problem is the meaning of the two 
extremes of a scale. If one end indicates a favorable attitude, does 
the other end represent unfavorableness or something else? For 
instance, is Fascism the opposite of Communism? . . . Are we deal- 
ing here with one or two dimensions? To interpret the extremes as 
being real opposites or to interpret the extremes as on different con- 


1 By Edward A. Suchman. 
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tinua is precarious in the absence of knowledge concerning the real 
meaning of the extremes.” ? 


The Problem of a Cutteng Point 


The need for an objective definition of a cutting point is especially 
vital for public opinion polls. Public opinion analysts and other 
workers in the field of attitude measurement have long been aware 
of the problem of question ‘‘bias.’”” There are numerous cases on 
record in which differently worded questions dealing with the same 
issue have produced different percentages of the population ap- 
parently “‘opposed to” or “in favor of” the issue being studied. 
Slight changes in the question wording, in the order of presentation 
of the answer categories, in the position of the question in the ques- 
tionnaire, and many other factors known to all pollsters may affect 
the findings of one’s poll. As a result of this variation in polling 
results, a great deal of attention has been given to the determination 
of criteria to be used in constructing an “unbiased” question. For 
the most part the criteria suggested in the past have been subjective, 
being based upon ‘expert opinion.” 

How can we account for this variation in poll results depending 
upon the question asked? As was discussed at length in the previ- 
ous chapters, the answer we propose involves the concept of an at- 
titude or opinion universe. Any single question asked on an issue is 
but one sample of all possible questions that could be asked on that 
same issue, and the proportion saying ‘“‘Agree’’ or ‘‘Yes” to these 
questions can theoretically range from 0 to 100 per cent.’ For ex- 
ample, a survey of soldiers’ attitudes toward the British asked, ‘‘Do 
you agree or disagree with the following statements about the 
British?” 


The British are doing as good a job as possible of fighting the 
war, everything considered. 


—_—__— The British always try to get other countries to do their fighting 
for them. 


We find 80 per cent of the soldiers agreeing to the first statement, but 
only 48 per cent disagreeing to the second statement. Which of 


3Quinn McNemar, “Opinion-Attitude Methodology,” Psychological Bulletin, Vol. 
43, No. 4 (July 1946), p. 299. 

* Compare the “tree” of public opinion in Hadley Cantril (editor), Gauging Public 
Opinion (Princeton University Press, Princeton, N.J., 1944), pp. 26-27. ‘‘Interven- 
tionist” sentiment was found to vary from 8 per cent to 78 per cent depending upon the 
particular question asked. 
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these questions is an “unbiased” question with respect to the entire 
issue? How many soldiers are ‘‘favorable” toward the British? 

It should be clear, then, that it is difficult to conceive of a mar- 
ginal frequency distribution which can adequately represent the 
whole universe when obtained from but a single question. There 
are innumerable nuances that can be put into questions by different 
shades of wording, different check lists, different ordering of ques- 
tions in the questionnaire, different atmospheres of answering a 
questionnaire, and the like. Each question used in a survey is but 
a sample of all the variations of it that could have been used instead. 

From these variations in polling results depending upon question 
wording, and from lack of “expert” agreement as to what is ‘‘biased”’ 
or “unbiased,” it appears quite clear that what is greatly needed is 
some objective method of dividing the respondents into the same 
proportions pro and con regardless of question wording. With such 
a method different opinion pollsters would come out with the same 
results, even though they should ask different questions concerning the 
same topic. These results would be independent of many of the 
various “biases” to which public opinion polling today is subject. 

Is such a method possible? Can there be an objective method for 
determining an invariant dichotomization of the population with re- 
gard to an opinion or attitude area? 


The Intensity Function 


An answer to the above question must involve the determination 
of a cutting point that is “meaningful” with respect to a whole scale 
continuum. A suggested method for doing this is proposed in the 
following sections. This method utilizes what we call the intensity 
function. It is based upon the concept of a scalable area in which 
individuals are ranked from high to low and for which it is possible 
to measure the intensity of feeling with which people at different 
scale positions hold their attitudes or opinions. Intensity of feeling 
is conceived to be strongest at both ends of the scale continuum and 
to decrease as one moves toward the middle. The scale position 
with lowest intensity serves to mark off a point of ‘neutrality’ 
which provides a division of the population into two groups, one 
positive and the other negative. Furthermore, as a consequence of 
the scalability of the area, this division does not depend upon the 
particular sample of questions used. The cutting point is deter- 
mined objectively and is invariant with respect to wording of ques- 
tions. 
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In the following sections we will first discuss in detail the theo- 
retical foundations for the intensity function. Then we will report 
upon various techniques used in practice to measure the intensity 
function and show several examples of the actual determination of 
cutting points by means of the intensity function. Finally, we will 
present two examples of the independence of the intensity function 
and the cutting point from the “‘biases’”’ involved in any specific 
questions asked. 


The Basis of the Intensity Function 


We conceive of a content scale which ranks people from high to 
low on a single content continuum. We furthermore conceive of 
an intensity scale which ranks people from strong to weak on a 
single intensity continuum. Correlation of these two scales—con- 
tent and intensity—produces a J- or U-shaped curve. As one 
moves from one end of the content scale, intensity of feeling de- 
creases until a point is reached at which intensity of feeling begins 
to increase again. This point represents the content position of 
persons with lowest intensity. It may be thought of as a point of 
indifference or neutrality.‘ 

We shall call this point the zero point. This zero point appears 
to offer us an invariant cutting point on a scale running from favor- 
able to unfavorable which permits us to divide the population into 
two groups which can be meaningfully labeled as “positive” and 
“negative.” Since the rank order of people on both content and 
intensity is independent of any particular sample of questions asked, 
the shape of the U curve and the location of the lowest point on this 
curve also remains independent of any particular sample of ques- 
tions asked. The intensity curve is fixed for any single attitude 
universe—any sample of questions from that attitude universe will 
produce the same results. 


The Ideal Curve 


An example of an ideal intensity curve with a perfect correlation 
between content scores and intensity scores is given in Table 1a, 
assuming a sample of content consisting of ten equally frequent 


4 The results of several previous studies on intensity of feeling seem to bear out this 
rational explanation for the existence of a U-shaped intensity curve. For example, 
Kats shows that the more “extreme” an attitude is, the more intensely it is likely to be 
held. D. Kats, “The Measurement of Intensity,” in Cantril, ibid., Chapter 3. This 
chapter deals with several different methods of measuring intensity of feeling in opinion 
surveys. 
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ranks and a sample of intensity consisting of six ranks. Along the 
horizontal axis is plotted the percentile order of respondents accord- 
ing to their content scale scores, while along the vertical axis is 
plotted the percentile order of individuals according to their in- 
tensity scores. Theoretically, the intensity scores must have a per- 
fect curvilinear relation with the content scale scores. All persons 
in any one rank or content must have the same rank on intensity. 


TABLE 1a 
Punrect CoRRELATION TaBLE OF ConTznt Soorgs BY INTENSITY 
Scorzs 
Content rank 
Intensity Total Cum 
rank (Neg.) Pos.) — freq. freq. 
12 8 § 6 6 7 8 § 
5 (High) 10 10 100 
4 10 10 20 90 
3 10 10 20 70 
2 10 10 20 50 
1 10 10 20 30 
0 (Lew) 10 10 10 
Total 
freq. 10 10 10 10 10 10 10 10 10 10 100 
Cum. 
freq. 10 20 30 40 50 60 70 80 90 100 


At least one of the extreme content ranks must correspond to the 
highest intensity rank, while the arms of the J or U must come 
down to the lowest intensity rank. 

In the intensity curve there are usually two content ranks for 
each intensity rank, except for the lowest intensity rank. The con- 
tent percentile of the persons with the lowest intensity rank there- 
fore corresponds to the zero point. Were it not for the presence of 
errors in the empirical measurement of the intensity function—a 
problem which will be discussed later—it would be possible to deter- 
mine the zero point along an attitude continuum simply by deter- 
mining the content rank for persons with the lowest intensity scores. 
This means locating a point along the content continuum which con- 
tains the persons who are most “‘indifferent’’ or “‘neutral.”’ 

The zero point thus established has a definite meaning with re- 
spect to the attitude universe. It separates those whose intensity 
of feeling goes up as their attitude is more favorable, from those 
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whose intensity of feeling goes up as their attitude is less favorable. 
A further important property is the invariance of the zero point. 
The intensity curve for one sample of items is essentially the same 
as the intensity curve for any other sample of items, and is, in fact, 
essentially the intensity curve for the entire scalable universe. The 
basic importance of the concept of testing a sample of items to 
determine if they belong to a scalable universe has been discussed in 
the previous chapters on scale analysis. It is because two different 
samples belong to the same scalable universe that they produce the 
same rank orders of individuals on both content and intensity, and 
therefore result in the same intensity curve and zero point. Two 
investigators would have to come out with the same percentages of 
the population on either side of the zero point regardless of the 
questions asked, provided only that they dealt with the same scal- 
able issue. An empirical demonstration of this property will be 
given later. 


The Problem of Error 


The research problem is to measure intensity empirically. If this 
can be done, the zero point can be determined by plotting intensity 
of feeling against degree of favorableness. The ideal curve is prob- 
ably impossible of attainment in practice. Present techniques for 
the measurement of intensity all contain so much error that in all 
cases intensity has been measured as a “‘quasi scale.” 5 One essen- 
tial difference between a scale and a quasi scale is that from scale 
scores one can reproduce the response of each individual to each 
item (within the margin of error provided by the coefficient of 
reproducibility of the scale), while in a quasi scale this internal 
reproducibility is not possible. However, the rank order of respond- 
ents on a sample of items from a quasi-scale area would still be essen- 
tially the same rank order as that in the universe of all items that 
could have been used. Therefore, the ordering of the persons on 
content and on intensity is essentially invariant with respect to the 
particular sample of questions used. 

The fact that intensity forms a quasi scale shows that there is 
one dominant factor in the sample of intensity questions asked, 
which we call intensity of feeling, and many other smaller factors. 
It is the presence of this dominant factor which permits the appear- 


* For a more complete discussion of a ‘quasi scale’ see Chapters 5 and 6. 
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ance of the desired U curve. Because of the error in our present 
techniques of measuring intensity, it is necessary to use some aver- 
age measurement of the intensity scores. The regression medians 
of intensity on content do yield a J- or U-shaped regression curve 
and seem to afford an adequate working picture of the intensity 
function. It is necessary to use the entire population sample for 
the determination of the intensity component. Proportions of the 
population at any given rank or on one side of the zero point are 
subject to ordinary sampling error. 

It is also advisable in an intensity analysis to have as many scale 
ranks as possible for both content and intensity scales. The more 
points one has to plot, the more detailed will the intensity curve 
become. In particular, the close determination of a zero point re- 
quires questions whose cutting points are in the region of the bend- 
ing of the curve, as contrasted to the determination of rank order 
alone which requires only the number of questions needed for the 
desired number of rank orders. 

It is to be expected that as further research develops better tech- 
niques for the measurement of intensity of feeling, the obtained in- 
tensity curve will more closely approximate the ideal curve. 


A Technique for Observing Intensity 


A simple approximation of the intensity function has been suc- 
cessfully attained by asking a question about intensity of feeling 
after each content question. One form used for an intensity ques- 
tion is simply: ‘‘How strongly do you feel about this?” with answer 
categories of “‘Very strongly,” “Fairly strongly,” and ‘Not so 
strongly.”’ Repeating such a question after each content question 
yields a series of intensity answers. Using the same procedure as 
was outlined previously for the content answers, these are scored 
and each respondent is given an intensity score. The intensity 
scores are then cross tabulated with the content scores. 

This method of measuring intensity of feeling corresponds to the 
technique of securing a judgment of a ‘“‘whole”’ by asking for judg- 
ments of different ‘‘parts’’ of the “whole.” ® It is often very diffi- 
cult to determine judgments of a complex object, but breaking that 
object down into several simpler components which can be judged 


‘This approach is developed in greater detail in P. Lazarsfeld and W. Robinson, 
“The Quantification of Case Studies,” Journal of Applied Psychology, Vol. 24, No. 6 
(December 1940), pp. 817-825. 
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may permit one to determine a judgment of the complex object by 
combining the judgments of its component parts. Thus, for ex- 
ample, one could determine people’s judgments of a horse not by 
asking the person whether he liked the horse as a whole, but by 
asking for simple ‘‘yes’’-“‘no’’ judgments about different parts of the 
horse, such as his head, his neck, his feet and his tail, and then com- 
bining these separate judgments. In a similar way, while it might 
be difficult to determine how strongly people felt about some broad 
area, such as their attitude toward their officers, asking them how 
strongly they felt about various specific aspects of their officers’ 
behavior permits one to combine these “‘smaller’”’ judgments into a 
general judgment of intensity of feeling on the area as a whole. 

The cross tabulation of intensity scores by content scores should 
produce, in an ideal case, a table similar to Table 1a in the previous 
section. However, due to the presence of error, we find that per- 
sons with the same content score may have different intensity scores. 
We therefore compute the median intensity for each content rank 
and use the curve of medians as an approximation of the true 
intensity function. To locate each median we determine for each 
content rank the individual who occupies the median position on 
intensity for that content rank and then find out what his intensity 
percentile is in the entire population. This is done by the ordinary 
formula for interpolation for the median of a continuous distribution 
when only class intervals are given. Medians are used, rather than 
arithmetic means or similar averages, because medians are inde- 
pendent of any metric apart from rank order. 

A proper graphic presentation of the curve of medians is to express 
the data not in the crude ranks observed, but in estimates of what 
the rank on the whole attitude universe would be if indefinitely many 
questions had been asked. The percentile metric offers a way of do- 
ing this. The cumulative frequencies for the row (content) and col- 
umn (intensity) marginals are computed at the bottom and to the 
right of the correlation table respectively. The last row of the 
table indicates the estimated median intensity percentile of each 
content column. The percentile metric is used for both content and 
intensity so that the people are considered to be arranged from 0 to 
100 per cent according to their rank on intensity. Each plotted 
point on the graph corresponds to the midpoint of the content inter- 
val and to the intensity percentile of the median case for each con- 
tent interval. 
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In practice, the values for the plotted points are determined quite 
simply. The cumulative percentages are computed for the total 
frequencies of both content and intensity, cumulating from negative 
to positive on content and from low to high on intensity. To deter- 
inine the content value to be plotted we compute the midpoint of 
the interval of percentiles for each content rank. The intensity 
value to be plotted is the median intensity percentile for the content 
rank, as just discussed. 

The procedure for studying the intensity function, thus, is very 
simple. It consists of (1) asking a series of questions with the con- 
tent of interest; (2) testing these questions to determine whether 
they all ask the respondents about the same single issue (this is 
done very simply by means of scalogram boards or any of the other 
techniques for carrying out a scalogram analysis that have come 
out of the work of the Research Branch); (3) obtaining for each re- 
spondent a content score based upon his rank order on the content 
scale (if the area is scalable; otherwise there is no point to the fur- 
ther steps); (4) asking a simple intensity question such as, “How 
strongly do you feel about this?” after each content question in or- 
der to obtain an intensity score for each respondent; and (5) plot- 
ting content scores by intensity scores to obtain a U- or J-shaped 
curve, the lowest point of which serves to divide the population into 
the desired objective and invariant “favorable” and “unfavorable” 
groups. 

This technique can best be illustrated by means of the actual ex- 
amples in the next section. 


Illustrations of the Intensity Function 


For the most part the following examples will deal with the same 
attitude areas used to illustrate the scalogram board technique of 
scale analysis. It will therefore not be necessary to repeat the con- 
tent scale pictures presented previously. For these pictures and 
for the complete wording of the questions asked, the reader can re- 
fer to Chapter 5. The present section will give only the wording of 
the content question itself without listing the answer categories. 
However, the complete wording of the intensity questions and their 
scalograms will be given. These intensity scalograms are con- 
structed and analyzed in exactly the same manner as the content 
scalograms and so no additional explanation need be given. The 
following are the areas of content to be used as illustrations here: 
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Table Universe 

BDe eyetnenddc teens Attitude toward Officers 

DY. | acetals le ike de Attitude toward the Army 

Oe, cituced esaveess Knowledge of Current Events 

By eo euaw enya Attitude toward Administration of 
the Army Score Card Plan 

Ose emia? Sagi: Attitude toward Idea of the Army 
Score Card Plan 

Ds. ste Regis nals 22's Attitude toward the WAC 

Sa ce eee Attitude toward Postwar Conscrip- 
tion 

Se oiiaweaksatveees Satisfaction with One’s Army Job 


These illustrations will serve to show that a U curve can be deter- 
inined empirically from content and intensity scores. However a 
great deal more work must be done on eliminating some of the error 
from these curves. The present research is only a first approach 
toward the measurement of the intensity function. 

Altitude toward officers. Using the technique outlined above, an 
intensity analysis was made for the attitude area of “Enlisted Men’s 
Attitudes toward Their Officers.” A scalogram of this area show- 
ing the questions asked and the scale pattern of responses was given 
in Chapter 5 (Scalogram 2). Each content question was followed 
by a question asking about the intensity of feeling. Scalogram 11 
shows the intensity questions that were asked after each content 
question and the scalogram pattern which resulted. The scale anal- 
ysis and rank order of individuals is determined independently for 
content and intensity, and consequently there is no reason for the 
number of scale types to be the same for the content and intensity 
scales. Since the sensitivity of the intensity curve is a function of 
the number of content and intensity scale scores that can be corre- 
lated, it is advisable in an intensity analysis to keep separate as 
many answer categories as possible so that the number of scale types 
is increased. 

From Scalogram 11 it can be seen that the responses to the in- 
tensity questions form a quasi scale. The meaning of a quasi scale 
has already been discussed in some detail. Since the reproducibility 
of responses from scale scores does not apply to quasi scales, we do 
not compute the coefficient of reproducibility. 

Each of the content and intensity questions were dichotomized 
in the present example. Since there were eleven content questions, 
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and nine of these were followed by intensity questions, we have 
twelve content ranks and ten intensity ranks. Each soldier re- 
ceives two scores: a content score ranging from 0 to 11 depending 
upon the number of questions upon which he holds “positive” atti- 
tudes toward officers, and an intensity score ranging from 0 to 9 
depending upon the number of answers about which he feels 
“strongly.” Tabulating intensity score against content score for 
all soldiers in the entire sample of 2,827 men yields the joint fre- 
quency distribution shown in Table 1b. 


TABLE 1b 


Enuietmp MEN’s ATTITUDES TOWARD THEIR OFFicers 
(Content Scores by Intensity Scores) 


Content rank Cum. 
I Total per 
rank (Neg.) (Pos.)  freg. cent 
2 $8 4 6 @ 7% 8 9 10 I! 
9 (High) 53 76 64 48 35 30 17 21 20 15 5 17 401 100 
8 21 35 40 20 37 20 28 16 15 1217 5 275 86 
7 15 27 44 35 32 20 22 17 18 183 6 4 253 7 
6 10 16 48 35 29 33 30 24 18 1212 6 268 67 
5 10 20 80 86 36 35 26 17 16 1110 1 236 58 
4 6 10 32 39 48 $9 41 21 23 14 5 1 274 49 
3 1 9 31 34 20 34 29 $7 S38 11 5 2 255 40 
2 1 4 12 17 47 52 28 32 31 18 6 1 244 = 31 
1 2— 8 26 82 82 52 37 32 20 4 3 248 22 
0 (Low) 2 8 6 33 32 56 62 65 59 40 9 1 9873 18 
Total frequency 121 205 310 321 352 351 335 287 264 161 79 41 2,827 
Cumulative per 
cent 4 12 22 34 46 59 71 81 90 96 99 100 
Midpoint of con- 
tent percentiles 2 8 17 28 40 53 65 76 85 9397 99 
Median of inten- 


sity percentiles 82 78 66 53 48 40 39 33 33 37 58 79 


If the pure intrinsic intensity function were being measured by 
the present technique, then Table 1b should show intensity as a 
perfect U- or J-shaped function of content. Table 1b shows that 
this is not the case, that there is much error in the technique; but 
despite the considerable amount of error, the essential shape of the 
intensity function is apparent. The italicized frequency in each 
column of Table 1b corresponds to the intensity rank in which the 
intensity of the median case lies for each content interval. 
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The last two rows of Table 1b give the percentiles of the midpoints 
of content and the percentiles of the medians of intensity, which are 
the values plotted in Figure 1. The method of computing these 
two points was described in the previous section. For example,’ 


12a. 


Scalogram No. 11. Intensity of Attitude toward Officers 
Questions and Answer Categories 


Did your officers give you a good chance to ask questions as to the reason why 
things were done the way they were? 


. How strongly do you feel about this? 


1 
2 
3 
4 
0 


Not at all strongly 
Not so strongly 
Fairly strongly 
Very strongly 

No answer 


. How many of your officers took a personal interest in their men? 
. Same as 12b. 


. Do you think that your officers generally did what they could to help you? 
. Same as 12b. 


. How well do you feel that your officers understood your problems and needs? 


Do you feel that your officers recognised your abilities and what you were able to 
do? 


. In general, how good would you say your officers were? 
. Same as 12b. 


. How many of your officers used their rank in ways that seemed unnecessary to 


you? 


. Same as 12b. 


. When you did a particularly good job did you usually get recognition or praise for 


it from your officers? 


. Same as 12b. 


. How much did you personally like your officers? 
. Same as 12b. 


. How did you feel about the officers that had been selected by the Army? 
. Same as 12b. 


. How much did you personally respect your officers? 
. Bame as 12b. 


7 Rounded numbers only are used in this example; for the actual computations, all 
percentages, including cumulative ones, were carried to three or four significant digita. 
Of course, there are other equivalent ways of arranging the arithmetic than that pre- 
sented here. 
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the midpoint of content percentile for content rank 7 is 76, arrived 
at as follows: 


( Cum. % of Cum. % of 
Cum. % of Content Rank 6 + Content Rank 7 5 Content Rank 6 
or, 

mn+oi=u 81 — a_i = 76 


The median of intensity percentile for content rank 7 is 33, arrived 
at as follows: 


Cumulative % of Intensity Rank 2 + 


otal 
Freq. of Cum. Freq. of 
alee ae - (ee: ea) Cum. % o: Cum. % 0: 
an or Intensit; — { Intensi 
2 Content Rank 7 (Intensity ' (Intensity } 
Frequency of Intensity Rank 3 
for Content Rank 7 
287 
=> — 134 
or, 31+ ( 2 (40 — 31) 
37 
= 31+ € (9) 
= 33 


The lowest plotted points fall at 76 and 85 per cent, and the 
bottom of the curve is fairly flat. The exact location of the zero 
point in the case of a flat curve is difficult to determine. However, 
the fact that a large range of “neutrality” exists is in itself an im- 
portant finding. In particular, the region in which the zero point lies 
is invariant. While it may not be possible to ascertain exactly 
where the zero point is, we can say that according to this sample of 
soldiers, 53 per cent of the enlisted men had an unfavorable atti- 
tude toward officers; 7 per cent had a favorable attitude toward 
officers; and the remaining 40 per cent were in between. As more 
and more questions are added to the scale, the size of the zero range 
will become more narrow and the location of the zero point can be 
determined with greater accuracy. 

The fact that the curve in Figure 1 is suspended in mid-air and 
does not reach the bottom percentile of intensity is due to the pres- 
ence of error in the technique for ascertaining intensity. The col- 
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umn medians are being plotted; since there is error, the medians 
are away from where they would be if there were no error. The 
essential shape of the intensity curve that would be obtained if 
there were no error seems rather apparent from Figure 1, and it 
seems safe to assume that the pure curve would actually touch bot- 
tom in a region within the zero interval obtained from the observed 
curve. 
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CONTENT PERCENTILE 
Figure 1. Enlisted men’s attitudes toward their officers. 


A technique for obtaining a single point approximation for the zero 
point is to use the median content percentile of the people lowest on in- 
tensity. In the perfect curve, the lowest person on intensity would 
be at the zero point on content. Since we have error present in 
practice, we can take an average of the content positions of the 
people lowest on intensity as an approximation to the ideal zero 
point. As small a percentage as possible should be used in order to 
avoid distortion, but the sample number of people included must 
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be large enough to be reliable. If we consider only the bottom row 
in Table 1b (intensity rank 0) the median content percentile of the 
373 men therein is, by interpolation, 68 per cent, and we can use 
this as an estimate of the true zero point. This estimate does fall 
in the zero interval just previously obtained from the U curve. 


Scalogram No. 12. Intensity of Attitudes toward the Army 
Questions and Answer Categories 


26a. All things considered, do you think the Army is run about as efficiently as possible, 
or do you think it could be run better? 
b. How strongly do you feel about this? 
1 Not at all strongly 
Not so strongly 
Fairly strongly 
Very strongly 
No answer 


2 
3 
4 


In general do you think the Army has tried its best to see that men get as square a 
deal as possible? 


28a. 
b. Same as 26b. 
20a. 


In general, do you feel you yourself have gotten a square deal from the Army? 
b. Same as 26b. 


30a. Do zou feel that the Army is trying its best to look out for the welfare of enlisted 
men 


b. Same as 26b. 


31a. In general, how interested do you think the Army is in your welfare? 
b. Same as 26b. ‘ 
33a. In the Army, some jobs are naturally harder and more dangerous than others and 
the Army has to put men where it thinks they are needed. 
Considering everything, do you think the Army is trying its best to see that, as 
far - possible, no man gets more than his fair share of the hard and dangerous 
jobs 
b. How strongly do you feel about this? 
6 


Not at all strongly 
7 Not s0 strongly 
8 ______ Fairly strongly 
9 Very strongly 
12 No answer 


34a. Do you think the Army is trying its best to see that the men who have the hard 
and dangerous jobs get the special consideration and breaks they deserve? 
b. Same as 33b. 
35a. On the whole, do you think the Army gives a man a chance to show what he can 
do? 
b. Same as 33b. 
36a. In general, how well do you think the Army is run? 
b. Same as 33b. 
37a. Do you think when you are discharged you will go back to civilian life with a 
favorable or unfavorable attitude toward the Army? 
b. Same as 33b. 
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TABLE 2 


Enuistap Mrn’s ATIrrupDE TOWARD THE ARMY 
(Content Scores by Intensity Scores) 


Content rank 
7 8 9 10 
73 64 50 37 
57 40 20 = 19 
41 33 38 26 
35 35 34 25 
42 86 23 26 
46 37 2&7 #6 
37 29 «.23—«18 
26 25 #28 29 
40 28 20 33 
24 30 38 29 
65 52 47 43 
486 400 357 311 
48 55 65 73 
37 49 60 69 
42 42 40 34 


77 


35 


83 89 


42 43 


Total 
(Pos.) freq. 
14 16 16 
30 22 66 797 
20 19 16 405 
15 17 7 363 
14 #11 4 307 
10 8 5 267 
5 5 2 268 
7 ~#1 1 214 
10 4— 221 
7° 1 — 214 
7 2 2 211 
6 7 3 347 
131 97 96 3,614 
95 97 100 
93 96 99 
56 62 81 
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Attitudes toward the Army. A second illustration of the intensity 
function concerns the area of enlisted men’s attitudes toward the 
Army. The scale analysis of the content questions in this area was 
presented in detail in Chapter 4, ‘The Scalogram Board Technique 
for Scale Analysis.” The scale pattern of the intensity questions 
which followed each of the content questions can be seen from 
Scalogram 12. 
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CONTENT PERCENTILE 
Figure 2. Enlisted men’s attitude toward the Army. 


Once again we find the familiar quasi-scale pattern of intensity. 
Each of the ten intensity questions was dichotomized, producing 
eleven intensity ranks. Several of the content questions were left 
as trichotomies making it possible to differentiate between seven- 
teen meaningful content types. The tabulation of intensity scores 
against content scores is given in Table 2. Figure 2 shows the in- 
tensity curve resulting from the plotting of median intensities. 
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As in the case of attitudes toward officers, this curve is definitely 
skewed. The lowest obtained point falls at 69 per cent, but once 
again the zero range is fairly broad. We can safely say that 37 per 
cent are negative as compared to 11 per cent positive, while 52 per 
cent are in between. 

Knowledge of current events. Scale analysis is useful not only for 
the investigation of attitudes, but for any universe of content which 
is observed as qualitative data. Information tests are important 
sets of qualitative data which require scale analysis if they are to 
be scored meaningfully. 

How are the intensity function and zero point related to infor- 
mation tests? Our customary verbalizations about information 
tests do not seem to consider “positive” and “negative’”’ knowledge. 
Knowledge seems usually to be thought of as nonnegative. That 
this may often be justified is illustrated by the following example. 

The universe of content used was knowledge of facts associated 
with the war. The scalogram for this area is given in Chapter 5, 
Scalogram 6. Intensity was asked by following each content ques- 
tion with a question inquiring as to degree of certainty of the answer. 
Scalogram 13 shows the scale pattern obtained by the responses to 
these intensity questions. 


Scalogram No. 18. Intensity of Knowledge of Current Events 
Questions and Answer Categories 


40a. Who is the Chief of Staff of the U.S. Army? 
b. How sure are you of your answer? 
6 ___ Very sure 
7 ____ Fairly sure 
8 Not sure 
12 No answer 


. Which of the following countries is at war against the Axis? 

. Same as 40b. 

Where waa the first major beach head established by the Allies in their invasion of 
Europe? 

. Same as 40b. 

Who is the recently appointed Secretary of State? 

Same as 40b. 

The man who said we would have “peace in our time” was _____? 
Same as 40b. 

The “Co-Prosperity Sphere” was a scheme of ________? 

Same as 40b. 

“‘Fasciam” began in _______? 

Same as 40b. 
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The tabulation of content scores by intensity scores is given in 
Table 3, while the intensity curve is shown in Figure 3. This curve 
has an interesting shape. It consists of but one arm of the U, and 
every content score may be said to be on the positive side of zero. 
Certainty increases as knowledge increases, while the less one knows, 
the less certain one is of one’s answers. This is in sharp contrast to 
attitudes where both positive and negative answers can be held with 
strong intensity. It would seem that if a person gives the wrong 
answer to an apparently factual question and yet is firmly convinced 
that his answer is right, there is a strong possibility that the question 
is not tapping an informational area but an attitudinal area. 

Additional examples. The shape of the intensity curve can vary 
considerably from area to area. This will be seen in the remainder 
of the examples to be given. However since the reader by now 
should be well acquainted with the scale pictures for the intensity 
function, we will not present the separate scalograms for the remain- 
ing examples. Suffice it to say that in each case content formed a 
scale and intensity proved to be a quasi scale. The tabulation of 
intensity scores by content scores will be presented, together with 
the intensity curves. 

Attitude toward administration of Army score card plan. An inter- 
esting example of intensity analysis is afforded by a study of sol- 
diers’ opinions of the Army’s demobilization score card plan. Two 
areas were to be studied. One was opinion of the idea of the score 
card as a demobilization plan and the other was opinion of how the 
plan was being carried out in practice. Let us consider this second 
area first. Six questions were asked as a sample of the universe of 
content. As in the previous examples, each question was actually 
in two parts; the first part concerned the content, and the second 
part elicited intensity of feeling. One of the questions was, for ex- 
ample: 


(a) In general, do you think the Army is trying its best to carry out the 
Army score card plan as it should be carried out? 

Yea, it is trying its best 

It is trying some, but not hard enough 

It is hardly trying at all 


(b) How strongly do you feel about this? 
Not at all strongly 

Not so strongly 

—__— Fairly strongly 

Very strongly 


TABLE 3 


Know.epex or Current Evants 
(Content Scores by Intensity Scores) 


Content rank 
Intensity Total Cum 
rank (Neg.) (Pos.) — freq. % 
0 1 2 3 4 § 6 7 
7 (High) - =—- = 4 2 6 6 15 32 100 
6 _- — 1 2 2 5 12 7 29 89 
5 _— 1 2 4 4 15 388 9 58 79 
4 1 — 1 4 10 10 8 2 36 60 
3 _- — 6 8 9g 7 3 4 37 47 
2 1 4 6 7 8 7 3 1 36 35 
1 1 8 4 3 5 1 — 1 23 23 
0 (Low) 16 8 6 7 5 3 0o—- = 44 15 
Total frequency 18 21 25 39 45 54 64 39 295 
Cumulative per cent 6 18 22 35 50 68 87 100 
Midpoint of content 
percentiles 3 10 17 28 #438 #459 78 93 
Median of intensity 
percentiles 9 17 29 39 #41 58 71 88 
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Figure 3. Knowledge of current events. 
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Each part a of the remaining five questions asked a different opinion 
question about how the plan was being carried out; each part b was 
identical with the above for all the questions. 

The joint distribution of content and intensity ranks for the atti- 
tude of soldiers on how the score card plan was being carried out is 
given in Table 4, and the approximate intensity function is plotted 
in Figure 4. The expected U curve of intensity on content results 
with the low point of the curve, the zero point, falling between per- 
centiles 28 and 67 of the content scale. This indicates a split among 
the soldiers in their attitude toward the way in which the score card 
plan was being administered, with about half the soldiers having a 
favorable attitude and half having an unfavorable attitude. In 
general, the curve has a rather flat bottom indicating a wide zero 
range or area of indifference. The shape of this curve is quite dif- 
ferent from the positively skewed curves of attitude toward officers 
and attitude toward the Army. , 

Attitude toward idea of Army score card plan. The other aspect 
of the problem, the soldier’s attitude toward the idea of the score 
card plan, produced an altogether differently shaped intensity curve. 
Eight questions, containing the content in part a and the intensity 
in part b, were asked as a sample of all questions that could have 
been asked in this area. 

The joint distribution of content and intensity ranks for soldiers’ 
attitudes toward the idea of the score card plan is given in Table 5, 
and the approximate intensity function is plotted in Figure 5. The 
zero point seems to be indefinitely far to the left. This indicates 
that there was little or no unfavorable opinion about the idea of the 
score card plan—practically everybody was favorable. Such a con- 
clusion could not be ascertained by looking at the marginal fre- 
quencies of the individual questions asked in this area. For ex- 
ample, one of the questions was: “In general, what do you think of 
the Army score card plan (the point system)?” The responses 
were as follows: 


It is very good 23% 
It is fairly good 49 
It is not so good 16 
It is not good at all 7 
Undecided 5 


100% 


It is important to notice that the technique of single point esti- 
mation of the zero point by using the content median of only the 


TABLE 4 


Sotpme OPrminion oF THB ADMINISTBATION OF THE Scors Carp 


(Content Scores by Intensity Scores) 
Content rank Cum. 
Intensity SEE ANE Total per 
rank (Neg.) (Pos.) freq. cent 
0 1 2 3 $ 6 6 
9 (High) 65 50 42 22 38 33 81 823 100 
8 44 86 70 40 45 652 6&7 374 90 
7 39 55 84 6 59 63 44 404 79 
6 16 67 98 85 84 85 25 460 66 
5 17 42 «98 114 1128 91 28 497 52 
4 12 34 180 128 157 157 Il 629 37 
3 10 21 64 82 70 19 1 267 17 
2 —- 8 46 4 28 9 1 138 9 
1 4 8 19 28 18 6 1 79 5 
0 (Low) 4 18 28 16 7 4 — 71 2 
Total frequency 211 369 669 620 605 519 249 3,242 
Cumulative per cent 7 #18 #39 5&8 76 92 100 
Midpoint of content 
percentiles 3 12 28 48 67 8& 96 
Median of intensity 
percentiles 79 #63 «+46 #38 #40 #47~«#81 
too 100 
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Figure 4. Soldier opinion of the administration of the score card. 


TABLE 5 


Sotpmn Opinion oF THE IpEa oF THE ScorB Cagp 
(Content Scores by Intensity Scores) 


Content rank 


6 18 46 73 32 19 30 84% 19 8 279 77 
5 13 54 102 32 36 S81 10 10 4 292 69 
4 19 55 113 67 S88& 28 18 5 — 343 60 
3 20 91 144 84 42 23 #11 3 2 420 49 
2 16 80 157 66 29 22 7 #3 #3 882 36 
1 32 119 177 70 27 8 6 2 — 441 24 
0 (Low) 69 134 80 37 15 10 3 2 — 35:0 i 

Total frequency 227 699 981 441 239 210 169 143 133 3,242 

Cumulative per cent 7 29 589 %72 80 86 91 96 100 

Midpoint of content 

percentiles 4 18 44 66 76 88 89 94 98 
Median of intensity 
percentiles 34 89 43 44 #51 64 76 83 91 
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Figure 5. Soldier opinion of the idea of the score card. 
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lowest intensity group breaks down in this instance. Using the 
technique of lowest intensity is not safe if the intensity function is 
too asymmetric. 

Altitude toward the WAC. An example of a very sharp intensity 
curve is afforded by the universe of soldiers’ attitudes toward the 
WAC. The content scalogram and the sample of questions used 
are given in Chapter 5, Scalogram 3. The tabulation of intensity 
scores against content scores is given in Table 6, while the intensity 
curve is presented in Figure 6. 

From this curve we see that there is a very definite zero point at 
75 per cent. The sharpness of this curve would indicate that there 
are few soldiers who are “neutral” in their attitudes toward the 
WAC, and that the ratio of approval to disapproval is approxi- 
mately 1 to 3. Very sharp U curves indicate a clear distinction 
between being positive and being negative. 

Attitude toward postwar conscription. Another example of a fairly 
sharp U curve is found in the area of soldiers’ attitudes toward post- 
war conscription. The content of this area can be seen from Scalo- 
gram 4, Chapter 5. Table 7 and Figure 7 give the tabulation and 
graph of content scores by intensity scores. 

The low point of this intensity curve is in the negative region of 
the content continuum, showing more soldiers in favor of postwar 
universal military training than opposed to such training. 

Satisfaction with one’s job. A final example of an extremely flat- 
bottomed intensity curve is that of soldiers’ attitudes toward their 
jobs. The content scalogram and questions asked in this area are 
given in Chapter 5, Scalogram 1. The tabulation of content by in- 
tensity scores is given in Table 8, while the intensity curve is drawn 
in Figure 8. 

There is a wide range of indifference running from the 34th per- 
centile to the 77th percentile. One conclusion from this curve 
would be that soldiers are not sharply divided in regard to satisfac- 
tion with their jobs. There is an appreciably large group that can 
be labeled as neither satisfied nor dissatisfied. 


Relativity of the Intensity Function 
As in the case of content scale analysis, the intensity function is 
relative to the population and the time of the study. This is to be 
expected since the intensity function is as intrinsic a part of the atti- 


tude universe as the content scale. For a detailed discussion of the 
relativity of an attitude universe, the reader is referred to Chapter 5. 


TABLE 6 


Enuistep Maun’s ATTITUDE TOWARD THE WAC 
(Content Scores by Intensity Scores) 


Content rank 
Intensity SU Total Cum. 
rank (Neg.) (Pos.) dreq. % 
0 1 2 8s 4 & 6 
6 (High) 98 78 47 #21 21 «#18 «2 292 100 
5 37 80 34 21 21 6 9 178 83 
4 17 60 46 22 28 il 10 184 73 
3 26 27 6&6 3 36 18 «+16 221 62 
2 4 22 6 60 6 7 18 256 49 
1 4 20 48 45 123 34 10 284 35 
0 (Low) 3 12 61 59 146 8 4 815 18 
Total frequency 183 259 866 267 435 134 86 1,730 
Cumulative per cent 11 26 47 62 87 95 100 
Midpoint of content ; 
percentiles 5 18 36 54 75 91 98 
Median of intensity 
percentiles 8 73 51 42 2 36+ 69 
1 100 
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Figure 6. Enlisted men’s attitude toward the WAC. 


TABLE 7 
ATTITUDE TOoWakD PosrwaR CONSCRIPTION 


(Content Scores by Intensity Scores) 
Content rank Cum. 
Intensity _—_—_— Total per 
rank (Neg.) (Pos.) freq. cent 
0 1 2 3 4 7 6 
5 (High) 60 42 23 19 84 55 174 407 100 
42 39 #17 «17)«387)— O58 OS 274 77 
3 24 38 %2 35 5&4 68 26 269 61 
2 2 47 8 4 60 37 12 252 47 
1 4 587 61 6&& 61 £422 5 252 33 
0 (Low) 12 6 69 121 49 9 2 822 18 
Total frequency 162 284 218 200 295 234 283 1,766 
Cumulative per cent 9 25 88 54 71 84 100 
Midpoint of content 
percentiles 5 17 81 46 62 77 92 
Median of intensity 
percentiles 69 40 20 2 41 © 81 
100 100 
(High) 
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Figure 7. Attitude toward postwar conscription. 


TABLE 8 


SatTisracTion wiTtH Onz’s Army Jos 
(Content Scores by Intensity Scores) 


Content rank 


Oo 1 #2 8 £ 6 6 7 8 910 II 


8 (High) 28 46 27 33 22 19 24 20 9 2523 2&4 
7 24 81 26 38 31 22 138 20 2116 5& 
6 1 7 #9 17 30 24 35 17 23 1611 — 209 70 
5 6 14 14 29 20 34 27 18 17 910 — 
4 2 3 15 17 8&8 3 36 19 20 10 1 — 
3 — 1 17 19 22 #7 33 18 7 116:1— 
2 1 4 9 19 25 34 31 25 14 #1 4 «#1 
— 2 2 12 35 39 38 23 13 8— 1 
— 8 7 12 29 43 33 21 15 31 — 


Total frequency 40 104 151 184 248 286 279 169 138 103 66 31 1,799 
Cumulative percent 2 8 16 27 40 56 72 81 89 95 98 100 


Midpoint of content 
percentiles 1 65 12 22 34 48 64 77 85 9296 99 
Median of intensity 
percentiles 86 80 63 52 41 37 38 36 47 6674 89 
100 100 
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Figure 8. Satisfaction with one’s Army job, 
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In regard to subpopulations of the total population, the intensity 
function can serve the useful purpose of permitting comparisons be- 
tween groups as to absolute percentages of various groups that are 
positive or negative. For example, with the aid of an intensity 
analysis one can say not only that privates are relatively less satis- 
fied with their jobs than noncoms, but also that there are more pri- 
vates than noncoms who are dissatisfied. This type of absolute 
comparison adds important information to our understanding of 
how groups differ. 

The comparison of percentages of a population who are positive 
and negative can also be made between different attitude or opinion 
areas. Thus, for example, with an intensity analysis one could 
compare the number of men who have favorable attitudes toward 
the British with the number of men who have favorable attitudes 
toward the Russians. 

Several examples of an intensity analysis for subgroups of a popu- 
lation are given below. From Figures 9 and 10, based on Tables 9 
and 10, it is possible to conclude that more noncoms than privates 
have negative attitudes toward the WAC. Both the zero range 
and the zero point of the noncoms are more to the right than those 
of the privates. 

Figures 11 and 12, based on Tables 11 and 12, compare privates 
and noncoms in regard to satisfaction with their job. It is quite 
clear from these curves that more privates than noncoms are dis- 
satisfied with their jobs. While one could conclude from a com- 
parison of content scale scores alone that privates are less satisfied 
than noncoms, the present comparison permits a direct statement 
of the absolute percentages dissatisfied in both groups. A striking 
comparison is given by the intensity function computed separately 
for men who have completed high school as civilians but who are 
only privates in rank. The zero point score falls at 89 per cent, 
showing a high degree of dissatisfaction. These results are shown 
in Table 13 and Figure 13. 

An interesting illustration of the ability of the intensity function 
to discriminate between groups is shown in Figures 14 and 15, 
based on Tables 14 and 15. The group of men who do not believe 
that the Army Score Card Plan would be carried out in practice the 
way it was supposed to be is seen to contain many more men with 
negative attitudes toward the Army in general, than the group of 
men who believe that the Army Score Card Plan would be carried 
out as-promised. 


TABLE 9: Arrrrups Ttowakp THES WAC—ParivarTss 


(Content Scores by Intensity Scores) 
Content rank 
Intensity Total Cum, 
rank (Neg.) (Pos.) freq o 
0 1 2 8 4 & 6 7 8 
15 (High) Ro ae pe eee a BS 9 100 
14 _ 6 4—-> — 1—- 1 — 12 98 
13 1 4 8 2 1 1 3 1— 21 96 
12 1 & 10 7 3 2 3 4— 35 91 
11 _- 2 10 2 4 4 4 z2— 28 84 
10 — 4h #6 8 7 8 6 Jf 4 78 
9 —- — 10 123 7 8 8 & — 44 O68 
8 _ 1 6 18 10 8 5 6 — 49 59 
7 _-_ _ 1 10 Wd 12 6 4 — “4 49 
6 _ 1 2 8 8 16 8 3 1 47 39 
5 —- 1—- 9 6 & 6 1 — 47 = 30 
4 —- —- —- 4 6 ® 6— — 45 2 
3 - - = 8 4 17 ie- — 25 10 
2 —- —- — 1 $ #7 8 — — 14 5 
1 - ell rl 2 3 1- — 6 2 
0 (lew) - - - -—- — 2 i- — 8 1 
Total frequency 7 26 62 77 «#+%7 187 #§&8 37 #=+S 474 
Cumulative per cent 1 7 2 36 51 92 99 100 
Midpoint of content 
percentiles 1 4 14 28 44 66 86 95 100 
Median of intensity 
percentiles 99 90 78 §& 4 24 40 @& 73 
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Figure 9, Attitude toward the WAC—privates 


TABLE 10: Arrrrups towarp THs WAC—Noncoms 


(Content Scores by Intensity Scores) 


Content rank 
Intensity Total Cum. 
rank (Neg.) (Pos.) freq. % 
0 1 5 J $s 4 6 6 7 
15 (High) a ee ee ee) ae 
14 3 10 4- -— 1- — 1 19 97 
13 _ & 16 1- —- — 1 1 24 92 
12 1 7 9 8 3 1 1 2 1 31 87 
11 _ 4 16 15 4 2—- — 1 41 80 
10 —_ 2 5 13 5 4 2 2 1 34 70 
9 _ 2 7 18 5 2 1 4-—_ 39 62 
8 — 2 6 WH 8 6 —- — 44 «53 
7 —- —- —- 8 8 F 2 3 1 80048 
6 —- — 2 146 7 8 £; — — 48 36 
i] -_-_ =- 2 10 6 18 s;—- — 40 25 
4 - 1 7 6 22 6 —- — 41 16 
3 - - - 2 3 5 2—_—> — 12 7 
2 — 111 4 7 1—> — #15 4 
1 oe, Seo ee eS, es ee NE he ayy te 
0 (Low) ee eee eee ee ee 2 — 
Total frequency 12 36 69 108 63 98 27 12 9 4384 
Cumulative per cent 8 11 27 52 66 89 95 98 100 
Midpoint of content 
percentiles 1 7 #19 #39 #4589 78 92 96 99 
Median of intensity 
percentiles 98 87 76 538 41 28 60 «90 
100 100 
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Figure 10. Attitude toward the WAC—noncoms. 


TABLE 11 


SaTIsFACTION WITH ONE’s JoB—PRIvATES 
(Content Scores by Intensity Scores) 


Content rank 
Intensity —_—— Total Cum. 
rank (Neg.) (Pos.) freg. % 
0-1 «8 $8 4 & 6 7 8 9-12 
8 (High) 39 #10 «613~«(o11)0S 10:8 6 38 19 124 100 
7 $1 12 IL 23 @ ) 4 9 9 118 86 
6 6 16 18 #18 #15 = «25 8 8 7 116 72 
5 10 8 17 9 2 18 7 4 5 98 59 
4 2 6 8 19 #1 18 6 6 4 90 47 
3 1 13 7 #13 #15 = «#18 8 2 3 80 37 
2 1 8 9 138 22 16 «12 8 — 84 28 
1 1 1 56 2 2 18 12 5 2 84 18 
0 (Low) 3 3 6 10 22 12 7 6 2 71 8 
Total frequency 84 77 88 136 165 147 70 46 51 865 
Cumulative percent 10 19 20 45 64 81 89 94 100 
Midpoint of content 
percentiles & 14 24 37 64 72 85 91 97 
Median of intensity 
percentiles 84 58 54 43 38 42 32 580 76 
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Figure 11, Satisfaction with one’s job—privates. 


Total frequency 
Cumulative per cent 
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percentiles 
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TABLE 12 


SATISFACTION WITH ONE’s JoB—NoNCOMS 
(Content Scores by Intensity Scores) 


Content rank 


x) 
- 
(—] 
_ 
oO 
oo 
= 
> 
wo 


57 69 79 92 


82 67 651 38 35 34 38 47 72 


INTENSITY PERCENTILE 
3 8 


i} 


g 


) 


0 


(Neg) 


20 40 €0 80 
CONTENT PERCENTILE 


Figure 12. Satisfaction with one’s job—noncoms. 


TABLE 13 


SaTIsFacTION witH ONE’s Jop—Epvucatep Privatzs 
(Content Scores by Intensity Scores) 


Content rank 
Intensity Total Cum. 
rank (Neg.) (Pos.) freqa. % 
0-1 2 8 4 6 6 7-8 9-12 
8 (High) 12 6 8 3 8 — 2 8 32 100 
7 7 38 38 6 4 8 4 8 3387 
6 8 6 6 5 7 5 4 2 88 74 
5 2 2 6 2 4 9 5 1 31 59 
4 — 4 4 3 9 6 1 — 26 46 
3 _ 1 2 2 4 4 8 1 17 36 
2 1 383 8 4 7 8 6 — 27 4=29 
1 1 1 1 3 6 7 6 — 24 18 
0 (Low) _- = 2 3 6 5 6 — 21 8 
Total frequency 26 26 30 31 #50 41 35 10 249 
Cumulative per cent 10 21 83 45 65 82 96 100 
Midpoint of content 
percentiles 5 16 27 39 55 74 89 98 
Median of intensity 
percentiles 85 64 52 49 38 39 32 78 
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Figure 13. Satisfaction with one’s job—educated privates. 
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The validity of the intensity function for analytical problems re- 
mains a field for future research. How much more the zero point 
can contribute to our knowledge of attitudes and opinions and how 
useful it can be for purposes of analysis has yet to be investigated. 
Briefly, we can say that flatbottom U-shaped curves indicate that 
there is not much difference between pros and cons; except for those 
relatively few people who are very extreme on either side, most of 
the people are relatively indifferent. Very sharp V curves indicate 
a clear distinction between being positive and being negative. 

The intensity function does appear to offer a promising solution 
to the problem of obtaining a meaningful cutting point in a universe 
of content, such that those persons whose rank order is on one side 
of the cutting point can be said to be positive in their expression of 
attitude and people on the other side negative. It requires no a 
priori judgment, is independent of the sample of questions used, of 
the way they are worded and the like, and is completely objective. 


Procedure for Intensity Measurement 


A great deal of experimentation with different methods of asking 
intensity of feeling has indicated that intensity can be measured by 
asking after each attitude question about strength of feeling (i.e., 
“How strongly do you feel about this?”) or degree of certainty 
(i.e., “How sure are you of your answer?’’) or difficulty of choice 
(i.e., “How hard was it for you to answer this question?’’), or by 
mixing all three forms of the intensity question. Matched groups, 
answering the same content questions but different intensity ques- 
tions, show no significant differences in the intensity curve or the 
zero range. ll forms of the intensity question produce quasi scales 
of intensity. 


Determining Both Content and Intensity by the Same Question 


The method of using two parts to a question to determine content 
and intensity involves almost doubling the part of the questionnaire 
devoted to a particular area. It would be desirable, if possible, to 
devise a means of determining an intensity function without in- 
creasing the length of the questionnaire. It so happens that other 
research workers have often combined the intensity with the content 
without distinguishing between the two. In the Likert form of 
phrasing questions, for example, a declarative statement is followed 


TABLE 14 


ATTITUDE TOWARD THE ARMY 
Question: “When the Army score card plan gets going, do you think it will 
really be carried out in the way it is supposed to be?” 
For those men who answer ‘‘Yes” 
(Content Scores by Intensity Scores) 


Content rank 
Intensity Total Cum 
rank (Neg.) (Pos.) freq. % 
0 1 2 8 4 6 6 7 
7 (High) 33 44 42 60 389 43 37 82 380 100 
1 80 33 31 29 37 37 = 33 242 76 


0 (Low) 138 21 41 30 38 46 = 10 202 13 
Total frequency 67 140 184 293 220 245 233 176 1,558 
Cumulative per cent 4 13 25 44 688 74 88 100 
Midpoint of content 
percentiles 2 9 19 3% 651 6 81 94 
Median of intensity 
percentiles 75 62 50 44 40 41 46 738 
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Figure 14. Attitude toward the Army (those who answered “yes” to the question). 


TABLE 15 


ATTITUDE TOWARD THE ARMY 
Qursrion: “When the Army score card plan gets going, do you think it will 
really be carried out in the way it is supposed to be?” 
For thoge men who answer “No” 
(Content Scores by Intensity Scores) 


Content rank 
Intensity Total Cum. 
rank (Neg.) (Pos.) freq. % 
0 I 2 8 4 6 6 7 
7 igh) 126 124 72 49 34 #413 «#16~=« «12 446 100 
6 42 6&1 44 35 18 9 7 9 215 67 
5 13 30 36 2 19 #11 3 — 132 51 
4 12 30 36 8s i117 9 9 7 153 41 
3 10 25 25 27 #12 «11 7 2 119 29 
2 3 14 #15 13 += 26 7 5 1 84 21 
1 2 138 21 12 «14 9 6 1 78 14 
0 (Low) 8 19 15 26 23 = 15 6 1 113 8 
Total frequency 216 306 264 215 168 84 59 33 = 41,340 
Cumulative per cent 146 39 «©6898 «€©750—CO87—ss«938s—s«éB-—s«100 
Midpoint of content 
percentiles 8 28 49 67 81 9 95 99 
Median of intensity 
percentiles 71 #58 46 #40 34 209 36 59 
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Figure 15. Attitude toward the Army (those who answered “no” to the question). 
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by an intensity check list. According to this, an attitude question 
would be put as follows: 


Do you agree or disagree with the following statement: 


Being a WAC is bad for a girl’s reputation. 
(Check one) 
——__.. Strongly agree 


Undecided 


isagree 
—___— Strongly disagree 


Such a form contains both the content and intensity aspect simul- 
taneously. If these can now be separated empirically, keeping the 
question in the same form, we could obtain the desired two scores— 
content and intensity—without lengthening the questionnaire.® 

Using the universe of attitude toward the WAC, a series of ques- 
tions containing five-part answer categories was asked. These 
questions were also followed by the intensity question, ‘““How sure 
are you of your answer?” in order to permit a direct comparison of 
the two different methods of measuring intensity. To obtain the 
content score, the questions were scaled in the ordinary fashion, 
using all five categories of the question. To obtain an intensity 
score, the answer categories were “folded over,” that is, the most 
extreme “positive” category was combined with the most extreme 
“negative” category to denote strong intensity. The middle three 
categories were combined to signify mild intensity. In this way 
the respondent could be given an intensity score for each item. 

In addition to this intensity score determined by “folding over’’ 
a single question, the respondent was given an intensity score based 
upon the separate intensity questions. Finally he was given a third 
intensity score based upon both his “fold-over”’ score and his in- 
tensity score from the separate intensity question. These three 

* An immediate objection to such an attempt is, of course, that obtaining two scores 
from the same question implies that the scores are not experimentally independent. 
Using the same data twice may introduce some form of spuriousness, because experi- 
mental errors will be correlated in the two scores. 

*The question might be asked, ‘““‘What effect does the following up of each content 
question with an intensity question have upon the responses to the content question?” 
To investigate this problem, the area of attitude toward officers was studied. In one 
questionnaire the content questions were asked, as previously described, with a part b 
of intensity following each content question. In the second questionnaire only the con- 
tent questions were asked. The reproducibility of the content scales in both cases was 
about .87. The scalogram pictures of both were very similar. The marginals on the 
questions were then, of course, also quite similar, indicating that the inclusion of the 
part b on intensity did not affect the men’s answers to the part a on content. 
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intensity scores were then cross tabulated with his content scale 
score. 

Figures 16, 17, and 18 (based on Tables 16, 17, and 18) show the 
results obtained by these three methods. The intensity curves and 
the zero points coincide alinost perfectly. According to all three 
techniques the ratio of unfavorable to favorable opinion about the 
WAC is 8 to 1. 

From this result it would appear that it is possible to measure 
intensity of feeling along with direction of content in a single question. 
However, additional experiments in the universes of attitude toward 
officers and satisfaction with one’s job failed to produce the desired 
intensity curves by means of the “fold-over”’ technique, although 
these curves were obtained by means of a separate intensity question. 
While further research may serve to improve the measurement of 
content and intensity by means of a single question, for the present 
it would seem safer to use separate intensity questions. This latter 
method has the advantage of calling for two distinct decisions from 
the respondent in regard to content and intensity separately and 
thus permitting the correlation of two independently determined 
scores. 

Generalized Intensity 

The technique of asking ‘(How strongly do you feel about this?” 
after each question is admittedly a crude one and accounts for much 
of the error in the observed relationship between intensity and con- 
tent. If the intrinsic intensity could be ascertained, the relation- 
ship should be perfect. How error is introduced by the technique 
just described can be shown in several ways. Some men would say 
“undecided” to a question. When asked part b about strength of 
feeling, they would say they felt ‘very strongly.”” When they were 
asked why they said ‘‘very strongly” to part b when they were unde- 
cided on part a, they would answer to the effect that by “very 
strongly” they meant that the problem was very important, or else 
that they were very sure that they were undecided, or some such 
thing. Thus, many of the responses were out of context and con- 
tributed to error in the frequency distribution. 

Even if all the responses were in context, there is still a contri- 
bution to error from the fact that the degrees of meaning of the 
words vary from subgroup to subgroup of the population. Verbal 
habits of people are considerably different. Some people will say 
“strongly agree” to almost anything when they are in favor of it, 


TABLE 16 


ATTITUDE TOWARD THE WAC 


Content and Intensity Measured by Two Separate Questions 
(Content Scores by Intensity Scores) 


Content rank 
Intensity Total Cum 
rank (Neg.) (Pos.) freq % 
1 2@ 8 4 & 6 
6 (High) 29 63 #78 47 #21 21 «#418 #12 8 292 100 
5 8 2 80 3 21 21 6 7 2 178 83 
4 — 17 60 4 2 28 il 7 3 184 73 
3 3 23 27 #6 39 36 13 16 — 221 62 
2 2 2 22 6 60 60 #87 17 1 256 49 
1 _ 4 20 48 45 128 34 9 1 284 35 
0 (Low) _ 3 12 61 59 146 3 3 1 316 18 
Total frequency 42 141 259 366 267 #435 134 70 16 = 1,730 
Cumulative per cent 2 11 26 47 «62 87 95 99 100 
Midpoint of content 
percentiles 1 7 18 36 54 75 91 97 #100 
Median of intensity 
, percentiles 88 80 73 51 42 28 36 55 88 
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Figure 16. Attitude toward the WAC (content and intensity measured by two 
separate questions). 


TABLE 17 


ATTITUDE TOWARD THE WAC 


Content and Intensity Measured by the Same Question 
(Content Scores by Intensity Scores) 


Content rank 
Intensity ——_—_-—— — —._ Total Cum. 
rank (Neg.) (Pos.)  freg. % 
o 1 2 8 £ 6 6 7 8 
6 (High) 2 13 —- — 1 2 1 — 2 46 100 
5 11 67 19 1 2 3 1 5 4 113 97 
4 1 48 94 26 10 7 9 15 6 216 91 
3 1 10 110 104 38 12 29 #2 2 323 78 
2 _ 1 29 167 76 64 32 20 2 381 60 
1 2 1 4 72 94 177 28 8 ~— 386 38 
0 (Low) — 1 3 6 5117 3 — — 25 15 
Total frequency 42 141 259 366 267 4385 1384 70 16 1,730 
Cumulative per cent 2 411 26 47 #62 87 995 99 100 
Midpoint of content 
percentiles 1 7 #18 36 54 75 91 97 100 
Median of intensity 
percentiles 98 92 76 52 35 21 41 66 = 87 
100; ireze) 
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Figure 17. Attitude toward the WAC (content and intensity measured by the same 
question). 


(Neg) 
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where other people would say ‘‘agree’’ under the same circum- 
stances. Especially with respect to the intensity questions, there 
are people who say ‘‘very strongly” to every question. This tend- 
ency to use or not to use strong adjectives we shall call “generalized 
verbal intensity.’”’ That such a concept exists as a quasi scale for 
the Army population has been shown in several surveys. 


TABLE 18 


ArrrropE towaRp THE WAC 
Intensity Measured by Both Single and Separate Questions 
(Content Scores by Intensity Scores) 


Content rank 
Intensity Total Cum. 
rank (Neg.) (Pos.) freg. ”o 
0 1 2 8 4 6 6 7 8 
15 (High) 38 138O—- 1 1 4 47 100 
14 8 388 #16 #1 1 2— 1 2 64 «(87 
13 1 St 4 11 2 8 585 j%44 $$ 1038 94 
12 5 20 8 2 10 5&§ 7 10 # 121 8&8 
il — 13 48 #38 14 10 6 6 2 £1837 81 
10 8 10 30 28 16 21 9 9 2 128 7 
9 1 6 28 49 2 18 #7 #146 — 1383 65 
8 —- 4 2 48 4 3 18 8 — 180 58 
7 — 2 12 8 ss 41 1 9 1 161 47 
6 — 2 10 48 31 57 18 5 1 172 38 
5 —- 1 4 4 389 70 18 2 — i171 29 
4 1 — 83 18 21 8 14 — — 14 £19 
3 - _ 1 #13 #15 8 9 —_—- — 88 10 
2 —- 1 2 5 15 2 6 — — 58 5 
1 _- 2 1 8 11 z2—- — 24 2 
0 (Low) — 1 1-—- 2 631314-—- = 11 1 
Total frequency 42 141 259 366 267 435 1384 70 16 1,730 
Cumulative percent 2 11 26 47 62 87 95 99 100 
Midpoint of content 
percentiles 1 7 #18 36 5&4 75 91 97 100 
Median of intensity 
percentiles 98 90 75 52 39 24 40 63 88 


Generalized verbal intensity can be measured by studying several 
different universes of content (attitudes) in a single survey. One 
part b question on intensity is selected from each of the different 
areas at random. Scalogram 14 shows the different content ques- 
tions included and the pattern of intensity responses to these ques- 
tions. From this scalogram it can be seen that the eleven random 
intensity questions form a quasi scale. It is therefore possible to 
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rank mdividuals along a scale dimension of what we have called 
“generalized verbal intensity.” 

The manner in which this generalized verbal intensity contributes 
to the errors in the present method of measuring intensity can be 
clearly demonstrated by the following procedure. Respondents are 
divided into three approximately equal groups depending upon their 
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Figure 18. Attitude toward the WAC (intensity measured by both single and sep- 
arate questions). 


position on the scale of generalized verbal intensity. Those persons 
in the upper third are individuals who tend to give strong inten- 
sity answers, those in the middle third tend to give moderate in- 
tensity answers, while those in the lower third tend to give weak 
intensity answers. Using the area of attitude toward the WAC, 
we then compute separate cross tabulations of content scores by 
intensity scores for each of these three generalized intensity groups. 


Category order 


Scarogram No. 14. Genppalizep VERBAL INTENSITY 


Respondent order 
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Scalogram No. 14. Generalized Verbal Intensity 
Questions and Answer Categories 


lla. Considering their responsibilities, how do you feel about the privileges that officers 
get compared with those which enlisted men get? 
b. How strongly do you feel about this? 
1 Not strongly at all 


2 Not so strongly 
3 Fairly strongly 
4 Very strongly 
0 No answer 
12a. On the whole, how is your morale? 
b. Same as 11b. 


18a. All things considered, do you think the Army is run about as well as possible, or do 
you think it could be run better? 


b. Same as 11b. 
15a. In general, do you feel you yourself have gotten a square deal from the Army? 
b. Same as 11b. 
18a. On the whole, do you think the Army gives a man a chance to show what he can do? 
b. Same as 11b. 
20a. What do you think of discipline in the Army? 
b. Same as 1b. 
23a. Did your officers use their rank in ways that seemed unnecessary to you? 
b. Same as 11b. 
26a. The Army has set up an Inspector-General system (the IG or Air Inspector) to 
handle complaints from enlisted men. How well do you think this system actu- 
ally works? 
b. Same as 11b. 
30a. In your opinion, is the Army Score Card (the point system) being carried out the 
way it was supposed to be? 
b. How strongly do you feel about this? 
1 Very strongly 
2 Fairly strongly 
3 Not strongly 
0 No answer 


34a. Here are some statements about Army rules and regulations and their enforce- 
ment. For each statement, check the answer that best shows how closely you 


agree with it. 
A person should obey only those Army rules and regulations which seem reason- 
able. 
b. How hard was it for you to answer this question? 
6 Very hard 
7 —__ Fairly hard 
8 _____ Not ao hard 
9 Not hard at all 
12 No answer 


46a. With which one of these statements concerning postwar relations with Russia do 
you come closest to agreeing? 
b. How strongly do you feel about this? 


6 Not at all strongly 
7 Not so strongly 
8 Fairly strongly 
9 Very strongly 
12 No answer 
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The results of these cross tabulations are shown in Tables 19, 20, 
and 21. By looking at the location of the medians of intensity for 
each content rank, we can see quite clearly why there is so much 
spread in the combined cross tabulation of all three intensity groups. 
The curve of medians for persons with strong generalized intensity 
(Table 19) is concentrated in the upper intensity ranks of the spe- 


TABLE 19 
ATTITUDE TOWARD THB WAC 
High Generalized Intensity 
(Content Scores by Intensity Scores) 
Content rank 
Intensity Total Cum. 
rank (Neg.) (Pos.) freq. % 
2 $8 4 6 
15 (High) ol —- — — — — 1 #4 ~~ 86 100 
14 1 81 #144¢=— 1 2— 1 2 52 93 
13 — 16 37 9 1 1 5 3 62 74 83 
12 2 8 26 21 10 1 5) 8 1 81 69 
1l _— 1 18 19 11 3 38.6 68 1 69 = 3 
10 1 — 9 2 8s 146 82 2 — 49 42 
9 1 1 6 138 6 6 # 5 — 39 0 33 
8 —_ 1 1 11 10 18 38 $= 42 25 
7 -_-_ — 1 8 8 6 4 1— 28 8617 
6 —- — 2 7 6 6 8 — — 24 «612 
5 - -—- — 2 7 #7 i1i—- — 17 7 
4 1-—- — 2 2 6 1- — 12 4 
3 - —- — — 2 2 —- — — 4 1 
2 —~—~——-——-— 2 din 3 1 
1 eee ee ee ee Se a | So. Sis 
0 (Low) —- -—- e—- er re re KS - — 
Total frequency 26 69 118 104 72 69 30 27 10 = 520 
Cumulative per cent 5 18 40 6 74 87 93 98 100 
Midpoint of content 
percentiles 3 12 29 650 67 80 90 95 
Median of intensity 
percentiles 0 86 65 40 26 21 33 «6&1 


cific area being studied—attitude toward the WAC. The curve of 
medians for persons with weak generalized intensity is located in 
the lower intensity ranks, while the curve of medians for persons 
with moderate generalized intensity falls within the middle intensity 
ranks. Combining all three generalized intensity groups for this 
attitude area serves to spread out the intensity scores through all 
intensity ranks, although the median intensity score for any single 
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content rank retains its relative position to median intensity scores 
for the other content ranks. This spread of cases in intensity ranks 
other than the rank of median intensity is what we call errors of in- 
tensity measurement. 

It is significant to note that computing content and intensity per- 
centiles and drawing the intensity curve for each generalized inten- 
sity group separately (Figures 19, 20, and 21) takes into account the 


TABLE 20 
ATTITUDE TOWARD THB WAC 
Medium Generalised Intensity 
(Content Scores by Intensity Scores) 
Content rank Cum. 
Intensity Total per 
rank (Neg.) (Pos.) —freg cont 
1 $8 4 & 6 7. 
15 (High) 8s 2-—--—--—- = 1 — 11 100 
14 2 6 2 1- -- — 11 98 
13 — il 9 2 1 2— 1 26 96 
12 2 7 16 6 — 4 2 1 38 92 
il _— 7 16 #16 8 6 2 4 53 86 
10 1 8 16 Il 6 4 5 4 55 77 
9 _ 4 19 27 8 5 38 6 72 68 
8 — 2 12 8 2 i! 8 1 81 56 
7 —- — 989 W @ 21 4 6 73 43 
6 - - 4 19 12 9 8 — 65 31 
5 _ 1 2 13 #10 15 6 1 48 20 
4 - - = 7 #11 = «19 2—-—> 39 12 
3 -— oC 4 6 9 4 — 23 6 
2 - - —- — 2 5 1 — 8 2 
1 - - —~ — 2 1 1— 4 1 
0 (Low) —-— —- - re Or hc i— 1 _— 
Total frequency 13 48 105 147 99 124 48 24 608 
Cumulative per cent 2 10 27 5&1 68 88 96 100 
Midpoint of content 
percentiles 1 6 19 39 6 78 92 98 
Median of intensity 
percentiles 99 88 72 36 27 84 64 


‘“‘bias” of generalized intensity and produces almost exactly the 
same U curve and zero point for all three groups. The similarity 
of the intensity function for the different generalized intensity 
groups indicates the important property of the invariance of the 
intensity function. The above results were also obtained in other 
attitude universes studied separately for the different generalized 
verbal intensity groups. 
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Generalized Verbal Intensity and the Single Question 


It was also found that responses to individual questions were re- 
lated to this generalized intensity. This finding emphasizes all the 
more that responses to any single content question must be regarded 
with caution and reemphasizes the need for an objective technique 
for obtaining a zero point such as that described in this chapter. 


TABLE 21 
ATrrropE TOWARD THE WAC 
Low Generalized Intensity 
(Content Scores by Intensity Scores) 
Content rank 
Intensity ——_— otal Cum. 
rank (Neg.) (Pos.) freq. 0 
O-1 8 6 6 7-8 
15 (High) 1- -—- —- —- —- — 1 100 
14 1 —- -—- —- —- —- — 1 100 
13 5 1- - —- —- — 6 100 
12 6 2 1- —- — 2 11 99 
11 6 6 4 — 1 1 — 17 97 
10 3 5 6B 2 2 2 5 24 94 
9 _ 6 il 7 2 2 4 82 90 
8 1 7 10 8 8 8 4 46 85 
7 2 1 18 9 14 7 3 49 717 
6 2 3 ee 13 29 7 6 82 69 
5 _ 1 25 #2 48 8 1 106 55 
4 _ 3 9 8 ¢@ Hh — 91 38 
3 — 1 9 7 39 5 — 61 23 
2 1 2 5 12 23 4 0— 47 13 
1 — 2 1 6 10 1 — 20 5 
0 (Low) 1 1 — 2 6 —- — 10 2 
Total frequency 28 41 #115 96 242 56 25 603 
Cumulative per cent 5 11 31 46 87 96 100 
Midpoint of content 
percentiles 2 8 21 38 67 91 98 
Median of intensity 
percentiles 96 84 61 48 34 583 82 


An example of how “generalized” intensity can be related to 
responses to a particular content question is the following—the 
sample of soldiers was divided into three groups according to their 
generalized intensity: low, medium, and high. When asked the 
question: 


All things considered, do you think the Army is run about as efficiently 
as possible, or do you think it could be run better? 
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the responses for the three groups turned out to be: 


GENERAL INTENSITY 


Low Medium High 

It is run as well as possible 38% 32% 28% 
It could be run somewhat better 42 36 27 
It could be run a lot better 20 32 45 

Total per cent 100% 100% 100% 
Total cases 441 800 526 


The categories obviously mean different things to the different kinds 
of people. 
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Figure 19. Attitude toward the WAC (high generalised intensity). 
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Another example would be the question: 


On the whole, would you say you are well adjusted or poorly adjusted to 
Army life? 


Low Medium High 

Very well adjusted 8% 17% 29% 
Fairly well adjusted 56 47 34 
Not so well adjusted 23 18 14 
Very poorly adjusted 13 18 23 

Total per cent 100% 100% 100% 
Total cases 441 800 5265 


The percentage of respondents answering “very well adjusted” 
varied from 8 per cent to 29 per cent depending upon the generalized 
verbal intensity habits of the respondent. 
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Figure 20. Attitude toward the WAC (medium generalised intensity). 
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The distribution of content scale scores was affected as well as the 
single question marginals. For example, the scale of attitude 
toward the Army showed the following relationship to generalized 
intensity: 


GENERAL INTENSITY 


Scale score Low Medium High 
0 to 6 18% 35% 45% 
7to9 44 32 23 

10 to 16 38 33 32 

Total per cent 100% 100% 100% 
Total cases 441 800 525 


A similar relationship between generalized verbal intensity and 
single question marginal frequencies or scale frequencies was ob- 
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Figure 21. Attitude toward the WAC (low generalised intensity). 
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served in almost all.cases. Responses to questions, therefore, in- 
volve not only the attitudes and opinions of the individual but also 
his verbal habits of expression. 


Empirical Demonstration of the Invariance of the Zero Point 
How can the claim be tested that the proposed method produces 
results independent of any “‘biases’’ in the specific questions asked? 
Perhaps the best way to destroy this claim would be to take two 
series of opinion questions on the same topic, but ‘“‘biased” in com- 
pletely opposite directions and producing apparently widely dif- 
TABLE 22 


ComPARISON OF DISTRIBUTION OF REepuigs TO “UNFAVORABLY BIASED” 
AND “FavoraBLy BiasepD” SERIES OF QUESTIONS 


OPINION OF ARMY OPINION OF OFFICERS 

SERIES Question Per cent Question Per cent 

number favorable number favorable 

Series ‘‘A”’ 1 8% 1 1% 

(“Unfavorably 2 14 2 15 
biased’’) 3 16 3 23 
4 31 4 29 
5 33 5 32 
6 43 6 47 
Series “B” 7 58 7 50 
(“Favorably 8 65 8 52 
biased”’) 9 70 9 68 
10 79 10 75 
11 81 11 83 
12 90 12 93 


ferent results, and then to see if the inclusion of the intensity meas- 
urement serves to ‘‘correct” these ‘‘biases’’ and produce the same 
final result for both sets of questions. Stated in other words, let us 
suppose that two different polling agencies wanted to find out how 
many people hold “favorable” or “unfavorable” opinions concern- 
ing a certain issue. One agency asks several questions which indi- 
cate that less than half of the people are “favorable” on each of the 
questions asked. The other agency asks several different questions 
which indicate, however, that more than half of the people are 
“favorable” on each of the questions that it asked. Both sets of 
questions deal with the same issue, but apparently produce com- 
pletely opposite results. Here is the problem of question ‘‘bias”’ 
stated in its most extreme terms. What kind of solution does the 
present method of intensity analysis offer? 
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Last 1 
Attitude toward the Army 


(The manner in which the answer categories were dichotomized can be seen from the 
weights of 0 or 1 appearing before each category.) 


Series A 
(“Unfavorably Biased’ Dichotomization) 


1. On the whole, do you think the Army gives a man a chance to show what he can do? 
(1) A very good chance 
(0) A fairly good chance 
(0) Not much of a chance 
(0) No chance at all 
(0) Undecided 


2. In general, how well do you think the Army is run? 
(1) It is run very well 
(0) It is run pretty well 
(0) It is not run so well 
(0) It is run very poorly 
(0) Undecided 


3. Do you think when you are discharged you will go back to civilian life witha 
favorable or unfavorable attitude toward the Army? 
(1) Very favorable 
(0) Fairly favorable 
(0) About 50-50 
(0) Fairly unfavorable 
(0) Very unfavorable 


4. In general, do you think the Army has tried its best to see that men get as square & 
deal as possible? 
(1) Yes, it has tried ita best 
(0) It has tried some but not hard enough 
(0) It has hardly tried at all 


5. Do too many of the things you have to do in the Army seem unnecessary? 
(1) No, not too many of them seem unnecessary 
(0) Yes, too many of them seem unnecessary 


6. In the Army, some jobs are naturally harder and more dangerous than others and 
the Army has to put men where it thinks they are needed. 
Considering everything, do you think the Army is trying its best to see that, as far 
as possible, no man gets more than his fair share of the hard and dangerous jobs? 
(1) Yes, it is trying its best 
(0) It is trying some, but not hard enough 
(0) It is hardly trying at all 


Series B 
(“Favorably Biased’? Dichotomization) 


7. In general, how interested do you think the Army is in your welfare? 
(1) Very much 
(1) Pretty much 
(0) Not so much 
(0) Not at all 


8. What do you think of the statement that ‘The Army makes a man out of you?” 
(1) There’s a lot to it 
(1) There may be something to it, but I’m still doubtful 
(0) There is not much to it 
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9. All things considered, do you think the Army is run about as efficiently as possible, 
or do you think it could be run better? 
(1) It is run about as well as possible, everything considered 
(1) It could be run somewhat better 
(0) It could be run a lot better 


10. Do you think the Army is trying its best to see that the men who have the hard and 
dangerous jobs get the special consideration and breaks they deserve? 
(1) Yes, it is trying its beat 
(1) It is trying some, but not hard enough 
(0) It is hardly trying at all 


11. In general, do you feel you yourself have gotten a square deal from the Army? 
(1) Yes, in most ways I have 
(1) In some ways, yes, in other ways, no 
(0) No, on the whole I haven’t gotten a square deal 


12. Do you feel that the Army is trying its best to look out for the welfare of enlisted 
men? 
(1) Yes, it is trying its best 
(1) It is trying some, but not hard enough 
(0) It is hardly trying at all 


List 2 
Altitude toward Officers 


(The manner in which the answer categories were dichotomized can be seen from the 
weights of 0 or 1 appearing before each category.) 


Series A 
(“Unfavorably Biased” Dichotomization) 


1. How much did you personally like your officers? 
(1) Very much 
(0) Pretty much 
(0) Not so much 
(0) Not at all 


2. How do you feel about the privileges that officers get compared with those which 
i men get? 
(0) Officers have far too many privileges 
(0) Officers have a few too many privileges 
(1) Officers have about the right number of privileges 
(1) Officers have too few privileges 


3. How much did you personally respect your officers? 
(1) Very much 
(0) Pretty much 
(0) Not so much 
(0) Not at all 


4, When you did a particularly good job did you usually get recognition or praise for 
it, from your officers? 
(1) Always 


10. 


11. 


12. 
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. How did you feel about the officers that had been selected by the Army? 


(1) They were the best ones that could have been picked 
(1) They were as good as any that could have been picked 
(1) Undecided 

(0) Somewhat better ones could have been picked 

(0) Much better ones could have been picked 


. Do you think that your officers generally did what they could to help you? 


(1) Yes, all the time 
(1) Yes, most of the time 
(0) No, they often did not 
(0) No, they almost never did 
Series B 
(“‘Favorably Biased’? Dichotomization) 


. How well do you feel that your officers understood your problems and needs? 


(1) They were very much aware of my problems and needs 
(1) They were fairly well aware of my problems and needs 
(0) They did not know very much about my real problems and needs 


. Do you feel that your officers recognized your abilities and what you were able to 
do? 


(1) Yes, I’m sure they did 

(1) Yes, I think they did, but I’m not sure 
(1) Undecided 

(0) No, I don’t think they did 


. How many of your officers used their rank in ways that seemed unnecessary to you? 


(0) Almost all of them 
(0) Moet of them 
(1) Some of them 
(1) Only a few of them 
(1) None of them 


In general, how good would you say your officers were? 
(1) Very good 
(1) Fairly good 
(1) About average 
(0) Pretty poor 
(0) Very poor 


Did your officers give you a chance to ask questions as to the reason why things 
were done the way they were? 
. (1) Yes, always 

(1) Yes, usually 

(1) Undecided 

(0) No, not very often 

(0) No, almost never 


How many of your officers took a personal interest in their men? 
(1) All of them 
(1) Most of them 
(1) About half of them 
(0) Few of them 
(0) None of them 


The answer is clearly demonstrated by two examples of exactly 


the above nature. The first concerns the universe of attitudes of 
soldiers toward the Army, while the second concerns soldiers’ opin- 
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ions of their officers. In each example a cross section of enlisted 
men in the United States was asked two sets of six questions each. 
These two sets of questions produced widely opposing percentages 
of the population with ‘‘more favorable” and “less favorable” 
opinions on each question separately. The extent of this diver- 
gence can be seen from Table 22. Series A was composed of ques- 
tions which resulted in less than half of the respondents expressing 
“favorable” attitudes on all six questions, while Series B was com- 
posed of questions which resulted in more than half of the respond- 
ents expressing “favorable” attitudes on all six questions. Each 
series of six questions was tested for scalability to assure that, from 
the respondent’s point of view, they all concerned the same opinion 
issue." The exact questions asked are given in Lists 1 and 2. The 
manner in which the answer categories were dichotomized can be 
seen from the scale weights of 0 or 1 given to each category.“ Each 
opinion question was followed by an intensity question asking, 
“How strongly do you feel about this?” 

From Table 22 it can be seen that, depending upon which series 
of six questions was used, soldiers’ opinions of the Army and of their 
officers could be characterized as “favorable” or “unfavorable.” 
One polling agency using Series A would conclude that the men 
“disliked”’ the Army and their officers, while another agency using 
Series B would conclude that the men “liked’”’ the Army and their 
officers. However, our claim is that, were both agencies to ask in- 
tensity of feeling also and to correlate opinion scores by intensity 
scores, they would draw substantially the same conclusions, regard- 
less of which set of questions was used. The U- or J-shaped curve 
obtained by plotting opinion scores against intensity scores would 
divide the population into the same percentages with ‘favorable’ 
and “unfavorable” opinions, regardless of question “‘bias.”” What 
do our data show in this respect? 

Tables 23a and 23b and 24a and 24b show the correlation between 
intensity scores and content scores for Series A and Series B. In or- 
der to plot these scores, the median intensity value is computed for 
each content value. These medians are italicized in the tables and 


* The two examples, opinion of the Army and opinion of officers, were obtained from 
two separate surveys. In each case, the two sets of six questions were mixed at random 
so that the respondent answered twelve questions in each area without knowing which 
six went together. 

1 Scalograms for each of these two areas are given in preceding chapters. 

13 Since the answer categories represent only qualitative gradations from more to less 
“favorable,” these weights can be given fairly arbitrarily to produce relatively high or 
low percentages with “favorable” opinions. 
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TABLE 23a 


ATTITUDE TOWARD THE ARMY 
Series A (‘““Unfavorably Biased’) 


Content rank 
Intensity —_ Total Cum. 
rank (Neg.) (Pos.) freq. % 
0 1 2 8 4 6 6 
6 (High) 222 112 49 #45 26 18 $0 502 100 
5 112 «61 29 #30 18 = 8g! 5 276 73 
4 91 76 «638 ~—=«(O31 9 5 5 255 58 
3 76 #61 60 #8 ii 8 2 236 44 
2 68 48 47 17° 16 1 1 188 31 
1 50 50 33 = 29 5 4 — 171 21 
0 (Low) 59 61 45 44 7 3 2 221 12 
Total frequency 668 469 291 224 92 60 45 1,849 
Cumulative per cent 36 61 77 89 94 98 100 
Midpoint of content 
percentiles 18 49 69 83 92 96 99 
Median of intensity 
percentiles 58 47 37 41 55 64 80 
TABLE 23b 
ATTITUDE TOWARD THE ARMY 
Series B (‘Favorably Biased’’) 
Content rank 
Intensity Total Cum 
rank (Neg.) (Pos.) freq. % 
o 1 &# S$ & 6 
6 (High) 85 45 63 70 96 90 164 563 100 
5 1 18 2 61 42 68 105 320 70 
4 — 11 21 29 4 6&8 64 222 52 
3 — 3 9 31 35 %5 72 206 40 
2 1 — ll 18 34 6&8 68 190 29 
1 2 § 12 24 49 75 167 19 
0 (Low) — 8 3 7 24 41 98 181 10 
Total frequency 37 87 137 228 299 415 646 1,849 
Cumulative per cent 2 7 14 26 43 65 100 
Midpoint of content 
percentiles 1 4 10 20 35 54 8&8 
Median of intensity 


percentiles & 71 #66 #57 #49 41 42 


272 


TABLE 248 


ATTITUDE TOWARD OvFiceRs 
Series A (“Unfavorably Biased’’) 
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Content rank 
Intensity Total Cum. 
rank (Neg.) (Pos.) freq. % 
0 1 2 8 4 6 6 
6 (High) 266 108 56 34 14 #15 12 502 100 
5 161 102 58 34 20 15 9 394 83 
4 188 113 66 43 20 20 10 410 69 
3 128 116 78 87 24 13 4 395 55 
2 9 96 78 6&8 86 6 1 359 41 
1 81 112 1038 68 27 12 2 405 29 
0 (Low) 6 108 109 96 48 15 1 437 15 
Total frequency 920 752 537 370 188 96 39 2,902 
Cumulative per cent 82 58 76 88 95 99 100 
Midpoint of content 
percentiles 16 45 67 88 92 97 = 99 
Median of intensity 
percentiles 66 48 89 88 36 5&6 71 
TABLE 24b 
ATTITUDE TOWARD OFFICERS 
Series B (‘‘Favorably Biased’) 
Content rank 
Intensity Total Cum. 
rank (Neg.) (Pos.) freq. % 
0 1 2 8 4 6 6 
6 (High) 26 67 &4& 91 9 8 68 510 100 
5 5&6 22 60 8% 75 69 & 355 82 
4 4 14 4 76 98 89 6 383 70 
3 = 6 24 67 116 110 74 306 57 
2 > 8 16 68 108 145 97 422 43 
1 _ & 12 31 87 129 8 344 29 
0 (Low) 8 4 7 2 86 169 190 492 17 
Total frequency 43 126 236 420 659 791 627 2,902 
Cumulative per cent 1 6 144 2 61 78 100 
Midpoint of content 
percentiles 1 4 10 21 40 65 89 
Median of intensity 
percentiles 8 838 74 63 50 39 35 
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are plotted in Figures 23 and 24. These charts show quite clearly 
the effect of the “biased” selection of questions and demonstrate 
how this “bias” can be taken care of by means of the present method. 

The agency using the “unfavorably biased’’ Series A on attitude 
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Figure 23*. Attitude toward the Army. 
Series A—positive end of universe (Table 23a—“unfavorably biased’’). 
Series B—negative end of universe (Table 23b—“favorably biased’’). 


* There is no Figure 22. To avoid confusion, numbering of scalogram tables and figures is kept parallel 
throughout this chapter. 


toward the Army would find from its intensity curve that the low 
point of the curve fell at 69 per cent, indicating approximately only 
31 per cent “favorable” and 69 per cent “unfavorable” in their 
attitude toward the Army. The agency using the “favorably 
biased”’ Series B would have to conclude that, despite the fact that 
none of its six questions showed less than 58 per cent “favorable,” 
the low point of the curve fell between 54 per cent and 83 per cent 
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(a crude single point estimate can be taken as the midpoint of this 
interval, or 68 per cent, indicating only 32 per cent ‘‘favorable’’); 
this agency would see that, to determine this point more closely, 
questions showing greater ‘“unfavorableness” toward the Army 
would have to be included in the series asked. Thus, both agencies 
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Figure 24. Attitude toward officers. 
Series A—positive end of universe (Table 24a—“unfavorably biased’). 
Series B—negative end of universe (Table 24b— “favorably biased’). 


(Neg) 


would come to the conclusion that about 30 per cent of the men were 
favorable, and about 70 per cent were unfavorable toward the Army. 

The same results are obtained for Series A and B on attitude to- 
ward officers. The intensity curve of Series A, with ‘“‘unfavorably 
biased’’ dichotomizations, shows quite clearly that approximately 
83 per cent have “unfavorable” opinions of their officers. Despite 
the fact that Series B shows no more than 50 per cent ‘“‘unfavorable” 
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on any of the six questions, its intensity curve would produce an 
estimate of between 65 and 89 per cent “unfavorable” (midpoint 77 
per cent), with a clear-cut indication that additional questions show- 
ing greater ‘“‘unfavorableness” toward officers are needed for a closer 
determination of the dividing point. 

Of course, the best answer would be obtained by combining 
Series A and Series B and asking the respondent all questions (or 
selecting alternate questions from each series). The clear-cut 
results obtained by asking all questions have already been shown for 
these two areas in the previous section. These previous figures 
show a careful sampling of all degrees of “favorable” or ‘“‘unfavor- 
able” opinions, and clearly indicate just how the entire population 
should be divided, so as to obtain an objective, invariant distribu- 
tion into “pro” and “con” which is free from the problem of ques- 
tion “bias.” 


Summary 


1. One purpose of intensity analysis is to provide an objective 
method of determining an invariant cutting point for an attitude or 
opinion scale. This cutting point will enable the research worker 
to divide his population into favorable and unfavorable groups, a 
division which will be independent of the selection or “‘bias’” of the 
specific questions asked. 

2. The basis of intensity analysis is the theory of scaling which 
provides a test for the single meaning of a series of questions. Any 
single question or series of questions on an issue is considered as 
simply a sample of all other questions on the same issue which might 
have been asked instead. The problem of scaling is to test whether 
the particular sample of questions used can be considered as belong- 
ing to an attitude or opinion universe that concerns only one topic. 
Once a selected series of questions has been determined to be scal- 
able, it is possible to rank the respondents from high to low on the 
single attitude universe being studied. 

3. In addition to the rank order of individuals on the content or 
attitude scale, it is also possible to determine the intensity with 
which an attitude is held. This intensity measurement is found to 
be a U- or J-shaped function of the content scale. People on both 
ends of the content scale feel more strongly than people in the middle 
of the scale. As one moves down the content scale, intensity of 
feeling decreases until a point is reached where intensity of feeling 
begins to increase again. This point is invariant for any single 
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attitude area, and regardless of the sample of attitude questions 
used will always divide the population into the same proportion 
with positive and negative opinions. Thus, this cutting point is 
both invariant and objective. 

4. Not only is the zero point invariant with respect to item selec- 
tion, but so also is the entire curve. The shape of the curve around 
the zero point shows whether there is a broad region of relative in- 
difference or neutrality, or whether the population is sharply di- 
vided on the issue. 

5. One method used in this chapter for measuring intensity of 
feeling was to ask, “(How strongly do you feel about this?” after 
each attitude question. Correlating the content scale scores with 
the intensity scores produced the hypothesized U- or J-shaped 
curve. This technique of measuring intensity is far from perfect 
and results in a large amount of variance around the curve. Fur- 
ther research will probably serve to reduce much of this variance. 
However, crude as it is, the present technique does work and has 
been used successfully in several instances. The fold-over tech- 
nique was also found to succeed occasionally; this technique has the 
advantage of not lengthening the questionnaire. 


CHAPTER 8 


PROBLEMS OF RELIABILITY" 


hree kinds of reliability. There are at least three important 

sources for variation in empirical results that are to be con- 
sidered in an attitude or opinion survey. One is the sampling of 
people; the second is the sampling of items or questions; and the 
the third is the sampling of trials. 

The problem of sampling of people is the usual one which arises 
when the whole population is not observed, but only a sample there- 
from is taken. How much do the sample results vary from what 
would have been obtained if the whole population of people had 
been studied? For scalogram analysis, we want to know if the 
hypothesis that the data are scalable for the entire population can 
be tested by means of only a sample of the people. In practice, 
pretests on about a hundred people have been used to decide whether 
or not an area of content was scalable. To what extent is this 
justified? 

The second problem of sampling is with reference to the universe 
of items. Any questions used in an attitude or opinion survey are 
but a sample of all possible similar questions that could have been 
used. The hypothesis that scalogram analysis intends to test is 
that the entire universe of items is scalable. In practice, a sample 
of ten or twelve items in a pretest has been used as a basis for the 
test. Can such a sample properly test the hypothesis? 

The third problem of sampling concerns the stability of responses 
of individuals to the items. If the survey were to be repeated over 
and over again on the same individuals and on the same items under 
similar conditions—without carryover from trial to trial—how much 
variation would there be in the responses of the individuals? This 
is the problem of test-retest reliability. 

Let us discuss each of these three kinds of reliability problems in 
turn, from the point of view of scalogram analysis. 

With respect to the sampling of people, we shall consider the relia- 

1 By Louis Guttman. 
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bility of the coefficient of reproducibility and of the frequencies of 
the various scale types. The coefficient, we shall find, can have 
high reliability in the small sample of people used for the pretest, 
while the frequency distribution of types (for the given sample 
items) requires the larger sample of the final study for stability. 

With respect to the sampling of items, we shall consider the relia- 
bility of the coefficient of reproducibility and show the need for the 
additional criterion of “improvement.’”’ We shall also study the 
reliability of the scale scores. The hypothesis of approximately 
perfect reproducibility will be found to be testable on the basis of a 
dozen or so items in a pretest. If the hypothesis is accepted, then 
fewer items can be used in the final study. 

As for test-retest reliability, or the sampling of trials, we shall 
find that approximately perfect scales necessarily have high relia- 
bility, both with respect to the individual’s response to each item 
and to his total score. 


The Sampling of People 


The hypothesis of approximate perfection. In order to develop a 
sampling theory for any problem, certain characteristics of the popu- 
lation distribution must be specified and the sampling scheme must 
be defined. The hypothesis to be tested by scalogram analysis is 
that the population has an approximately scalable distribution of 
responses for the given items, that is, that each item is highly repro- 
ducible from scale scores. Such a hypothesis of approximate per- 
fection differs somewhat from customary hypotheses found in 
sampling theory elsewhere. The usual kind of hypothesis is that 
complete imperfection exists: that there is no difference between 
groups or that variables are not correlated, etc., etc. 

It is quite easy to refute a hypothesis about perfection by means 
of a sample drawn by alinost any scheme, random or not. For ex- 
ample, it is easy to prove that height and weight are not perfectly 
correlated for a given population by taking three people and find- 
ing that there is not a perfect linear relation between their heights 
and weights. Three people are sufficient here because if there were 
a perfect relationship in the population, this would have to hold in 
any sample whatsoever. If all points lie on a straight line, then 
any sample of those points must lie on the same straight line. 
There is no room for sampling variation. 

Similarly, if a set of items were perfectly scalable for a popula- 
tion, it would have to be perfectly scalable for any sample of people 


PROBLEMS OF RELIABILITY 279 


whatsoever drawn from that population. If an item is a simple 
function of the scale scores for the population, it must remain the 
same simple function of the scale scores in any sample drawn from 
that population. This is a form of perfect correlation, and there is 
no room for sampling variability. If, in the population, all people 
with the same scale score have the same response to an item, then 
any sample of people with that scale score will have only the given 
item response. 

It is no trick, then, to destroy the hypothesis that a set of items is 
perfectly scalable. Sampling error with respect to people can occur 
only when there is imperfection or scale error in the population. In 
practice, we do not find perfect scales in samples, which therefore 
means that our data are not perfectly scalable for the populations 
from which the samples are drawn. This leads to difficulties in 
formulating an analytical statement of the population distribution 
of responses, because there are many varieties of possible popula- 
tion patterns of error. The coefficient of reproducibility is but one 
aspect of error, and even its sampling distribution thus far has not 
yet yielded to a complete mathematical analysis. The sampling 
theory of scale error is still in its infancy as far as fully rigorous 
treatment goes. 

The reliability of the coefficient of reproducibility. There is, how- 
ever, cogent reason for believing that the higher the coefficient of 
reproducibility for the population, the less likely is the reproduci- 
bility of any random sample of people to depart from it by any given 
amount. There is reason to suspect that the sampling behavior of 
the coefficient should not be radically different from the sampling 
behavior of ordinary proportions (which follow the binomial distri- 
bution) especially when the population reproducibility is high (say, 
over .90). As is well known for the random sampling of ordinary 
proportions, if P is the population proportion and if N is the size of 
each random sample, then the standard error of sample proportions 


is (faa. This standard error decreases as P approaches 
N 


unity, and in fact is zeroif P = 1. 

It is highly plausible that a similar situation should hold for the 
coefficient of reproducibility. We do know that the standard error 
of sample reproducibility coefficients is zero if the population repro- 
ducibility is unity, for there is no room for sampling variation in the 
perfect scale. If there is but little error in the population, then 
there is little chance for error in a sample, and the more error there 


280 PROBLEMS OF RELIABILITY 


is in the population the more chance for error there is in a sample. 
Hence, the variance of sample coefficients of reproducibility should 
increase as the reproducibility of the population decreases. It 
seems highly plausible, then, that the sampling theory of ordinary 
proportions can be used as an approximation to the sampling theory 
of the coefficient of reproducibility. If so, a random sample of a 
hundred people, with responses to ten items, provides a base of a 
thousand observations on which to compute the coefficient of repro- 
ducibility. If the errors of reproducibility are independent of each 
other, and if all the items have equal reproducibility for the popu- 
lation, then the sample coefficient of reproducibility would have 
exactly the same distribution as an ordinary random sample propor- 
tion based on a thousand cases. If the items have unequal repro- 
ducibility for the population (and errors are independent), then the 
coefficient would have the distribution of an ordinary proportion of 
a stratified random sample, which would have a smaller variance 
than an unstratified random sample with the same average popula- 
tion proportion. The crux of the matter lies in whether or not the 
errors are independent. An adequate test of this hypothesis of 
independence has not yet been devised. 

In any event, requiring reproducibility of at least .90 tends to 
insure that, by and large, only populations with reproducibility in 
that neighborhood will be inferred to be approximately scalable, 
since it seems clear that the sampling variance of the coefficient of 
reproducibility cannot be exorbitant in this range. 

Empirical studies of variation in reproducibility. Empirical stud- 
ies of the sampling variability of approximate scale patterns have 
been made, and they indicate very high stability both for the 
amount and the general pattern of error. Only one example need 
be presented in detail here. Seven questions on attitude toward 
officers were asked on one form (Scalogram 15: Form A) of a ques- 
tionnaire to a cross section of enlisted men, and a different set of 
seven questions on another form (Scalogram 15: Form B). The 
questions are as follows: 


Attitude toward officers 
Form A 


25. Do your officers give you a good chance to ask questions as to the 
reason why things are done the way they are? 


1 Yes, always 
2 Yes, usually 
3 Undecided 


31. 


32 


35. 


37. 


38. 


27. 
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4 No, not very often 
5 No, almost never 
In general, how good would you say your officers are? 
1 Very good 
2 Fairly good 
3 ____ About average 
4 Pretty poor 
5 Very poor 
How many of your officers use their rank in ways that seem unneces- 
sary to you? 
1 Almost all of them do 
2 Most of them do 
3 —_—___ Some of them do 
4 Only a few of them do 
5 None of them do 


. How many of the officers in your company (battery, squadron, troop) 


are the kind you would want to serve under in combat? 
1 ____ All of them are 


2 —___ Most of them are 
3 ____. About half of them are 
4 Few of them are 
5 None of them are 
How much do you personally like your officers? 
Very much 
2_____ Pretty much 
3 Not so much 
4 Not at all 


How many of your company officers are the kind who are willing 
to go through anything they ask their men to go through? 


1 All of them are 
2_—____ Most of them are 

3 _____.. About half of them are 
4 Few of them are 

5 None of them are 


How do you feel about the officers that have been picked for your 
outfit? 


1 They are the best ones that could have been picked 


2 They are as good as any that could have been picked 
3 _____ Somewhat better ones could have been picked 
4______. Much better ones could have been picked 

5 Undecided 


Form B 


How many of your officers take a personal interest in their men? 
1 _____ Alll of them do 

Most of them do 

____— About half of them do 

Few of them do 

None of them do 


2 
3 
4 
5 
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28. Do you think that your officers generally do what they can to help 


you? 
1 _____ Yes, all the time 
2 Yes, most of the time 
3 No, they often do not 
4 No, they almost never do 
29. How well do you feel that your officers understand your problems 
and needs? 
1 They are very much aware of my problems and needs 
2 They are fairly well aware of my problems and needs 
3 —____ They do not know very much about my real problems and 
needs 
30. Do you feel that your officers recognize your abilities and what you 
are able to do? 
1 Yes, I’m sure they do 
2 Yes, I think they do, but I’m not sure 
3 No, I don’t think they do 
4 Undecided 


33. When you do a particularly good job do you usually get recognition 
or praise for it from your officers? 


Always 
2 Usually 
3 ______ Rarely 
4 Never 


36. When your officers give you something to do, do they tell you enough 
about it so that you can do a good job? 


1 _____ Always tell me enough 
2 Usually tell me enough 
3 _____ Often do not tell me enough 
4______. Almost never tell me enough 
5 Undecided 
39. Can you count on your officers to back you up in your duties? 
1 Yes, always 
2 Yes, usually 
3 No, you can’t count on it 


A random sample of a hundred men was drawn for the first form 
and a scalogram analysis made. The resulting scalogram for Form 
A is shown in Scalogram 15, A. The coefficient of reproducibility 
is .89. 

The scalogram for Form B, based on the responses of the same 
men used for Form A, shows a closely similar pattern of error with 
the coefficient of reproducibility of .92, as shown in Scalogram 15, B. 

Similar results have been obtained in many other experiments. 
Perhaps one more example might be mentioned briefly. In a study 
of desire to return to school after the war, the pattern of error for a 
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sample of three thousand enlisted men was studied and found to 
agree closely with the pattern of error previously obtained in a 
sample of a hundred men drawn at random from the three thousand. 

Reliability of marginal frequencies and other aspects. It should not 
be inferred from the fact that a hundred cases may be adequate to 
test the hypothesis of scalability that they are therefore adequate 
for use in the final study to portray the distribution of the attitude 
or opinion. That is quite a different matter. The proportion of 
people at each scale type or score is subject essentially to the laws 
of ordinary random sampling. If the proportion in the population 
of a given scale type is not close to zero or unity, then the sample 
proportion can have considerable sampling variation. Such pro- 
portions have a base of only a hundred observations, which is a small 
sample on which to base an ordinary porportion. For example, 
compare the proportions of each scale type for the two examples of 
men on attitude toward officers just shown above. The frequency 
distributions of the scale types differ according as to what might be 
expected from only a hundred cases. In order to obtain a stable 
distribution of scale types—which is important for external predic- 
tion purposes and for internal purposes like obtaining a zero point 
by intensity analysis—the usual three thousand or so cases are of the 
magnitude desired. 

A small sample of about one hundred cases may be used in a pre- 
test to test the hypothesis of scalability, and the hypothesis may be 
rejected or accepted on the basis of the sample. If the hypothesis is 
accepted, then the final study can be conducted on a larger sample to 
obtain reliable frequencies for the scale types. Fewer items can be 
used in the final study than in the pretest, as is discussed next in 
connection with the problem of sampling of items. 

If items with four or five categories are used initially, and if scalo- 
gram analysis shows that much combination of categories is required 
to obtain high reproducibility, then it is desirable to use more than a 
hundred cases in the pretest. The initial reproducibility is so far 
from perfect that it may allow for substantial sampling error in the 
combinations. Similarly, quasi scales can be expected to have a 
greater sampling variance in their error pattern because of their 
imperfect reproducibility, and hence require larger samples than a 
hundred in a pretest. The future undoubtedly will bring sharper 
mathematical and empirical evidence to bear on the sample size 
to use for these various aspects of scalogram analysis. 


: Form A (Coefficient of Reproduci- 
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Respondent order 


Scarogram No. 15. ATTITUDE TOWARD OFFICERS 
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The Sampling of Items 


The hypothesis of approximate perfection. In any attitude or opin- 
ion study, only a sample of items is used from the entire universe of 
content under consideration. It is difficult to think of items as 
being drawn by anything like a random process from the universe. 
Questions are constructed by the research worker. He selects a par- 
ticular wording of the question, a particular aspect of the content 
to emphasize, etc., etc. It is not as if there were available a list of 
all possible questions and their variations from which those used in 
the study were drawn at random. It seems that random sampling 
theory is not adequate for the problem of inference for the case of 
sampling from a universe of content. 

One of the fortunate aspects of scale analysis is that the hypothesis 
it wants to test differs in its nature from the ordinary hypotheses of 
statistical inference in other fields. The hypothesis is that an ap- 
proximately perfect pattern exists, whereas most of current statisti- 
cal inference, as mentioned in the previous section, is concerned 
with testing hypotheses of lack of relationships. Just as the hypoth- 
esis of a perfect linear correlation can be destroyed by a sample of 
three people, whether drawn at random or not, just so can the 
hypothesis of a perfect scale be destroyed by a sample of three—or 
even two—items, no matter how they are drawn. Therefore, to 
test the hypothesis of a perfect scale we do not have to worry about 
the fact that items are not necessarily selected at random from the 
universe of content. No matter how they are selected, they can 
serve to refute the hypothesis that the universe of content is per- 
fectly scalable for the population. 

In practice, the hypothesis of perfect scalability has always been 
refuted by the samples obtained. For example, we never find in a 
fourfold table between two items that there is a perfectly zero cell 
as required by the perfect scalogram pattern. (See Chapter 3.) 
This means that neither item can be perfectly reproducible from 
scale scores derived from the universe of items, for they are not even 
perfectly reproducible from scores derived only from themselves. 

In practice, we are concerned not with a hypothesis of perfect 
reproducibility, but rather a hypothesis of approximately perfect 
reproducibility. Can we infer, if the reproducibility of a sample of 
items is .95, that therefore the reproducibility of the entire universe 
of items is in a relatively small neighborhood of .95? If a million 
items have a reproducibility of .90, how much variation would there 
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be in the reproducibility of all samples of ten items each that could 
be drawn from this larger set? These questions do not seem sus- 
ceptible to solution by ordinary random sampling theory because, 
as was indicated just previously, items are not selected by any ran- 
dom process. Furthermore, we have no way yet of specifying the 
population distribution of items. How many items are there in the 
universe which will split the population into two groups of 50-50? 
How many will split the population into a 60-40 division? How 
many into a 70-30 division? Etc., etc. 

Reproducibility and improvement as criteria. In order to avoid 
having to use a notion of random sampling, it seems that we must 
use a criterion which will involve again some notion of perfection so 
that there will not be much room for variation regardless of how the 
sample is drawn. Then it will be relatively immaterial whether the 
sample is random or not. 

In practice we have used two criteria. One is that each item sep- 
arately should have high reproducibility. This is a kind of perfec- 
tion. If all items in the universe have reproducibility of at least 
.85, then every item drawn from the universe, whether random or 
not, will have reproducibility of at least .85. Hence, if in a sample 
of ten items, each item separately has reproducibility of at least 
.85, it would seem plausible to infer that the universe reproducibility 
was at least that high. Therefore, a criterion used is not only that 
the total sample reproducibility shall be around .90, but that each 
separate item should have reproducibility not much below .90. 

A second criterion is to take care of the possibility that sample 
reproducibility is spuriously high. It is easy to see that for any 
item whatsoever, whether it belongs to a scalable universe or not, 
the reproducibility cannot be less than its highest category fre- 
quency. For example, in a dichotomous item where 90 per cent of 
the population is in one category and 10 per cent is in the other, 
there cannot be more than 10 per cent error for reproducing that 
item from any set of scores whatsoever, regardless of the relation- 
ship of that item to the other items or to the scores. By simply 
predicting every person to be in the category with the largest fre- 
quency, the proportion of correct estimates will be the relative fre- 


* The population distribution of items determines the shape of the intensity and other 
components of a scale, according to the equations in the next chapter. Hence, by ob- 
taining an empirical intensity function as in Chapter 7, we obtain indirect information 
about the universe of items. It seems plausible that studying the further components 
empirically may be a proper approach to determining the universe distribution of items. 
This would avoid the problem of obtaining random samples of items. 
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quency of that modal category. Hence, it is important to guard 
against spuriously high sample reproducibility for items which have 
modal categories which contain a vast majority of the population. 
To do this, we use the criterion of improvement. Not only must 
reproducibility of each item be high from the trial scale score, but 
the scale error must be at most half of that which would be obtained 
without knowledge of the scale pattern, that is, from the modal fre- 
quencies alone. 

An example of rejecting the scale hypothesis. Let us illustrate how 
high reproducibility can be obtained from a sample of items from a 
nonscalable universe, and yet the hypothesis of scalability will be 
rejected by means of the criterion of improvement. In order to 
keep the data simple, we shall consider a case of four dichotomous 
items which are statistically independent of each other. If these 
are ‘Yes’”-“‘No” questions, by statistical independence is meant that 
the proportion of people who say ‘“‘Yes”’ simultaneously to two ques- 
tions is equal to the product of the proportion of those who say 
‘*Yes” to the first question and the proportion who say “Yes” to the 
second question; the proportion who say “Yes” to the first question 
but “No” to the second question is equal to the product of the pro- 
portion of those who say “‘Yes”’ to the first and the proportion who 
say ‘‘No” to the second question; and similarly for the other two 
combinations of responses. Statistical independence is the condi- 
tion of no relationship whatsoever between the items, an opposite 
of scalability, which means that each item is a perfect function of 
the same variable. 

If the four items are statistically independent of each other, then 
we can calculate not only the joint occurrences two at a time of the 
items from their separate frequencies, but also the higher order joint 
occurrences, including four at a time. For example, the proportion 
of the population who say ‘“‘Yes” to all four questions is equal to the 
proportion who say ‘‘Yes’’ to the first question times the propor- 
tion who say ‘‘Yes” to the second question times the proportion 
who say ‘‘Yes” to the third question times the proportion who say 
“Yes” to the fourth question. There are sixteen possible combi- 
nations of responses from four dichotomous questions, and the pro- 
portion of people who will fall into each group can be calculated by 
forming products in this manner if we know the frequencies of Yes’s 
and No’s for each item separately. 

Let us assume that the first question has 20 per cent ‘“Yes” and 
80 per cent “No,” the second has 40 per cent “Yes” and 60 per cent 
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“No,” the third question 60 per cent ‘‘Yes” and 40 per cent “No,” 
and the fourth question 80 per cent “‘Yes’’ and 20 per cent “No.” 
Given further that the four items are statistically independent (and 
hence nonscalable), we can calculate the frequency for each of the 
sixteen possible kinds of combinations of responses to the four ques- 
tions, perform a scalogram analysis on these, and wind up with the 


TABLE 1 


Four Inpprmnpgnt Dicnoromizs 


Category 
Group oo Frequency 
VPE Yes, Yen Yeu Yeu Noa No No Nau 
1 4 x x x x .0384 
2 4 x x x x .0096 
3 4 x x x x .0256 
4 3 x x x x 1536 
5 3 x x x x 0384 
6 3 x x x x -1024 
7 2 x x x x 2304 
8 2 x x x x 0576 
9 2 x x x x .0576 
10 2 x x x x 0144 
11 1 x x x x -1536 
12 1 x x x x 0384 
13 0 x x x x .0384 
14 0 x x x x .0096 
15 0 x x x x .0064 
16 0 x x x x .0256 
Marginal 
frequency 20 40 -60 80 80 .60 40 20 1.000 
Error .1264 .0320 — _ _ —  .1280 .1200 
Nonerror .0736 .3680 .6000 .8000 .8000 .6000 .2720 .0800 


scalogram arrangement shown in Table 1. The scalogram picture 
in Figure 1 will show more clearly the distribution of error. In 
Table 1, the z’s indicate the responses, but the frequency is shown 
in the right-hand column, since we have shown but one z for each 
group instead of repeating enough z’s to represent the proportion of 
people in the group. Figure 1 shows the frequencies of each type 
graphically, so that error can be estimated visually. Continuous 
line segments are used in Figure 1 instead of the usual columns of 
dots; the length of each segment shows the per cent of cases in the 
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category at the given rank intervals. Scale types are defined as in 
Chapter 4, according to the criterion of least number of errors. 
The reproducibility of this sample of four items is .90. However, 
we would not infer therefore that the universe from which these four 
items are drawn has a reproducibility of .90; the second criterion of 
improvement is not satisfied. At least two of the items have spuri- 
ously high reproducibility according to our second criterion. The 
first and fourth questions, because of their marginal frequencies, 


Yes, Yes, Yes, Yes, No, No, No, No, 
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Figure 1. Four independent dichotomies. 


PROBLEMS OF RELIABILITY $91 


cannot have more than 20 per cent error regardless of scalability 
because each has 80 per cent of the cases in its modal category. 
The reproducibility of these items in the sample is higher than 80 
per cent because the scale arrangement is based partly on the items, 
so that there is some improvement over 80 per cent since the item 
itself is used in a partially circular fashion. Actually, the first item 
has 13 per cent error in its first category, which is more error than 
nonerror for this nonmodal category. Similarly, the fourth ques- 
tion has 12 per cent error in its “No” category, which is also more 
than half the frequency of its nonmodal category. Neither of these 
items has had its error cut down by at least half, so we conclude that 
their reproducibility is spuriously high. 

Since two of the four questions, or half the sample of items, are 
judged to have spuriously high reproducibility, we reject the hy- 
pothesis that the universe from which they are drawn is scalable. 

It is clear that if we had used more than four independent items, 
we would have been all the more certain to reject the hypothesis of 
scalability. The more items in the sample, the more possible is it 
for error to appear if it is present in the universe of items. 

Anempirical example. If the items are not independent but have 
positive intercorrelations, they still need not be scalable. Using 
about a dozen items has been found empirically to be rather satis- 
factory in distinguishing between cases which are sufficiently scal- 
able and those which are not. Asan example, the following mixture 
of thirteen items was subjected to a scalogram analysis: 


27. How satisfied are you about being in your present Army job instead 
of some other Army job? (Check one) 

Very satisfied 

—_—_— Satisfied 

It does not make any difference to me 

—_—— Dissatisfied 

Very dissatisfied 

34, After the war do you think you would like to come back to your old 
outfit for two weeks of training every year? (Check one) 

Yes, I would want to very much 

—_—— Yes, I might want to, but I’m not sure 

No, I don’t think I would like to 

No, I would not want to at all 

Undecided 


36. If you were sent into actual fighting right now, how do you think you 
would do? (Check one) 

I think I would do all right 

I think I would have trouble at first, but after a while I 

would be O.K. 
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——_—— I don’t think I would do very well 
I haven’t any idea how I would do 


Do you think that after the war soldiers in your outfit will find it 
easier or harder to get jobs than they did before the war? (Check 
one) 


Easier than before the war 
—__— About the same as before the war 
___. Harder than before the war 
Undecided 


Do you think it is at all likely that we will fight England in the next 
25 years or so? (Check one) 

Not possible 

—_— Not very likely 

—__— Quite likely 

Almost certain 


. How much do you think the United States is doing to win the war? 


(Check one) 

Doing al it possibly can 
Doing all it can in most things 
Could do quite a bit more 
Could do a whole lot more 


. How do you think this war will come out? (Check one) 


It will be a complete victory for us 

———— We will win, but it won’t be a complete victory 
——— It will end in a draw 

The other side has a pretty good chance of winning 
Undecided 


Do the men in your company (battery, squadron, troop) co-operate 
and work well together? (Check one) 

All of the time 

___— Most of the time 

____— Often do not 

__— Almost never do 

Undecided 


Do your noncoms give you a good chance to ask questions as to the 
reason why things are done the way they are? (Check one) 

Always 

—___—. Usually 

Not very often 

——_____ Almost never 

Undecided 


How many of your company officers are the kind who are willing to go 
through anything they ask their men to go through? (Check one) 
All of them are 

——__— Most of them are 

——_— About half of them are 

Few of them are 

None of them are 
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89. Do you think the equipment of the American Army is better or worse 
than the equipment of the German Army? (Check one) 
All of our equipment is better 
Most of our equipment is better 
—___— Our equipment is about the same as that of the German 
Arm 


y 
——_—— Most of our equipment is worse 
_____— All of our equipment is worse 
Don’t know 


92. In general, how serious an offense do you think it is for a solider to go 
AWOL? (Check one) 
—____ Very serious 
Pretty serious 
Not so serious 
Not serious at all 
Undecided 
It depends on the condition 


96. Up to now, has the war made things better or worse for your family 
and friends back home (such as jobs, pay, and living conditions)? 
(Check one) 

Things are a lot better 

____— Things are a little better, but not much 

——__— I don’t see any difference 

____— Things are a little worse 

—___—. Things are a lot worse 

Don’t know 


These questions are not drawn from any one universe of content 
and are not expected to be scalable, although it is known there are 
positive correlations between the various kinds of content involved 
in this mixture. The reproducibility is only .82. Furthermore, 
practically every question would be suspect on the basis of the cri- 
terion of improvement. This result is obtained here even though 
the questions were not originally dichotomies so that there was more 
room for sampling variation and for the appearance of spuriously 
high reproducibility through the combination of categories. 

An example of trichotomies. The hypothesis of scalability seems 
even more tenable if high reproducibility and proper improvement 
are obtained when three or more categories can be retained in at 
least some of the items. For example, on the basis of only three 
dichotomies it will usually be hard to reject the hypothesis of scal- 
ability even though it is false, but it is easier to reject the hypothesis 
if the items are trichotomous. In Table 2 and Figure 2 are shown 
a scalogram analysis of three independent trichotomies. The items 
are indicated by the numbers 1, 2, 3, and the categories by the let- 
ters a, b, c. The twenty-seven possible combinations of answers 
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have their relative frequencies computed, assuming that the three 
items have the marginals indicated and are independent. The 
reproducibility is .84, and each of the three items has a category 
with more error than nonerror. It is clear, then, that if items can 
remain in trichotomous form in a scalogram analysis, it 1s more plausi- 
ble that the universe is scalable than if they have to be combined. It 
is easier for error to appear—if it actually should be present—when 


TABLE 2 


Taurens INDEPENDENT TRICHOTOMIES 


Category 
Group pie Freq. 
ype la 2a S80 1b 2 Sb tc & S$ 
1 6 x x x .006 
2 6 x x x .012 
3 6 x x x .002 
4 6 x x x .006 
5 5 x x x .036 
6 5 x x x .012 
7 5 x x x .018 
8 4 x x x 108 
9 4 x x x 018 
10 4 x x x 036 
ll 3 x x x .216 
12 3 x x x 036 
13 3 x x x .036 
14 3 x x x .072 
15 3 x x x 072 
16 2 x x x .006 
17 2 x x x .108 
18 2 x x x .036 
19 2 x x x .054 
20 2 x x x .018 
21 1 x x x .012 
22 1 x x x .036 
23 0 x x x .018 
24 0 x x x .006 
25 0 x x x -006 
26 0 x x x 002 
27 0 x x x 012 
Marginal 
frequency 10 .20 320 .60 .60 .60 80 .20~ .10 
Error 074 .114 .072 .012 .000 .012 018 .114 .074 


Nonerror .026 .086 .228 .588 .600 .688 .282 .086 .026 
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more categories are used; if error does not appear in this case, it is 
more plausible to assume that there was not much error really to be 
expected in the population. 

The reliability of scale ranks. Thus far we have discussed the 
sampling of items with respect to reproducibility. A related aspect 
is the stability of the ranking of people. To what extent does the 
rank order of people on a sample of items correspond to the rank 
order that would be obtained on the universe of items? 
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Figure 2. Three independent trichotomies. 
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In one sense, the correspondence can never be perfect between 
sample and universe ranks even for the case of a perfect scale. For 
example, a sample of ten dichotomous items from a perfect scale 
can yield at most eleven distinct ranks. In the universe of items, 
there may be an infinite number of distinct ranks. The sample 
ranks, therefore, represent groupings or class intervals of the uni- 
verse ranks, and there would not be a one-to-one correspondence 
between sample ranks and universe ranks. The sample ranks will 
be a simple function of the universe ranks, but not vice versa. Two 
people can have the same sample rank but differ somewhat in their 
universe rank. Moreover, two people with different sample ranks 
must also have different universe ranks. One person who is higher 
than another in the sample must be higher than the other in the uni- 
verse, if the universe is perfectly scalable, whereas two people with 
the same sample rank may be either in the same or in adjacent ranks 
in the universe. 

In practice, even for a perfectly scalable universe, it would not 
usually be desirable to work with a large number of ranks. In much 
of attitude and public opinion research it is sufficient to work with 
deciles or quintiles or even grosser classifications. The infinite 
number of ranks in the universe, therefore, is not needed; the ranks 
obtainable from a sample can be quite sufficient for the purpose of 
the research. If finer rankings are desired like percentiles, then of 
course a larger sample of questions is needed. For the perfectly 
scalable universe, then, the number of questions to use in a sample 
depends upon the number of ranks between which it is desired to 
differentiate for the given research problem. 

For the case of an imperfectly scalable universe, which is the usual 
case in practice, we have to worry about not only the ultimate num- 
ber of ranks desired for a particular research project, but also the 
reliability with which an individual is placed into his proper rank on 
the basis of the sample. The unreliability we are concerned with 
here is that due to the sampling of items. Just like the coefficient 
of reproducibility, there can be no sampling variation in the fact 
that the sample ranks must be a simple function of the universe 
ranks for the case of a perfectly scalable universe, and the possibility 
for unreliability in sample ranks increases as the reproducibility of 
the universe goes down. Requiring that reproducibility be about 
.90 or higher means that as a consequence we obtain very substantial 
reliability for the ranking of individuals. 

Relationship between part score and whole score. As an empirical 
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example, let us consider the area of attitude toward officers. A 
sample of eleven questions was used in this area, each of which was 
dichotomized in the course of the scalogram analysis. Thus, total 
scores in the sample of eleven questions ranged from 0 to 11. The 
sample was broken into two parts, one of five questions and the other 
of six questions, by arranging the questions in order of the frequency 
of “favorable” responses and selecting every other question. The 
subsample of five questions we shall call the “first half,” and the 
subsample of six questions we shall call the ‘‘second half” of 
the sample. 

The final survey for these items was made on a cross section of 
three thousand enlisted men. The correlation between the scores 


TABLE 3 
Tae ConRELaTION BerwmEn HaLF AND WHOLE Scorzs 


Whole Score on firat half 
score 0 1 2 8 4 5 Total 
11 01 01 
10 01 .03 04 
9 04 .02 .06 
8 -02 .06 02 10 
7 .05 .05 10 
6 02 -08 02 12 
5 .06 06 12 
4 .02 07 .03 12 
3 -06 04 10 
2 01 -08 02 ll 
1 03 .05 .08 
0 04 .04 
Total .08 21 .21 24 18 .08 1,00 


on the first half and the total scores is shown in Table 3. Table 3, 
of course, does not show what we are actually interested in. What 
we would like to know is what the relationship is between scores on 
a sample of questions and scores on the entire universe. Since we 
cannot study an entire universe in practice, what we are doing here 
is showing relationship between the scores on a subsample of ques- 
tions and scores on the total sample. 

The relationship in Table 3, of course, may be somewhat spuri- 
ously high because the subsample is included in the total score. On 
the other hand, it should be remembered that for discrete data with 
limited possible intervals such as we have in this sample, there is a 
maximum possible correlation for given marginal frequencies. Given 
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that the scores on the first half and that the total scores have the re- 
spective marginal frequency distributions given in Table 3, the best 
possible correlation table that they could have is shown in Table 4. 
Table 4 is what the joint distribution of these two scores would be if 
both were simple functions of the same quantitative variable. It is 
impossible for the part score and total score to be perfectly related 
because their marginal frequencies are so different. At best, both 
the scores can be perfect functions of a third score which, in this 
case, would presumably be the rank on the universe of items. 

If we take the scale ranks assigned to the first half and to the 
total at their face value as actual numbers, and if we use the ordi- 


TABLE 4 


Maximum PosstsLE CoRRELATION oF First Har wirh WHOLE Score 
For Frxmep MARGINAL DisTRIBUTIONS 


Whole Score on firet half 
score 0 1 g 3 5 5 Total 
11 01 01 
10 ‘04 ‘04 
9 03 ‘03 06 
8 10 10 
7 05 05 10 
6 12 12 
5 05 ‘07 12 
4 12 12 
3 06 ‘04 10 
2 at YW 
1 04 ‘04 ‘08 
0 04 04 
Total 08 21 21 24 18 08 1.00 


nary product-moment coefficient of correlation, we find the corre- 
lation between the first half and the total score in Table 3 to be .93, 
and the maximum possible correlation in Table 4 to be .97. 
Relationship between two half scores. Another way of looking at 
the matter of reliability of ranks which will eliminate the possible 
element of spuriousness in Table 3 is simply to correlate scores on 
the first half with scores on the second half. This does eliminate 
the possibility of spuriousness but, on the other hand, is a bit more 
removed from the direct problem of finding out the correspondence 
between the sample rank and the universe rank. What we shall be 
doing here is to investigate the correspondence between rank on one 
sample and rank on another sample. The table for the correlation 
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between scores on the first and second halves of the sample is given 
in Table 5. 

Again, since the marginal distributions of the two half samples 
are different, their scores cannot possibly have a perfect correlation. 


TABLE 5 


CORRELATION BETWEEN Scorns ON THE Two Har SAMPLES 


Score on Score on first half Total 
second half 0 1 2 8 4 6 

6 01 OL 02 

5 02 04 .03 09 

4 02 .05 06 02 15 

3 02 .06 .08 05 02 23 

2 01 .06 07 .06 02 22 

1 03 .08 04 03 18 

0 04 05 02 11 
Total 08 21 21 .24 18 .08 1.00 


The maximum possible correlation between the two halves, given 
the marginals of Table 5, is shown in Table 6. At best, the two half 
sample scores can be simple functions of a third variable, the uni- 
verse ranks, in which case their joint occurrence is that shown in 
Table 6. Treating the half scores as though they were actual num- 


TABLE 6 
Maximum PossisLEt CoRRELATION BETWEEN THE Two Ha.tF 
SamMpyes FoR Fixep MARGINAL DISTRIBUTIONS 


Score on Score on first half 
second half 0 1 2 3 4 aoa 


Qn 


03 08 ‘09 


23 23 

21 01 22 

18 18 

.08 03 11 


Total .08 21 21 24 .18 08 1.00 


Ce NWR aAID 


bers and computing the ordinary product-moment coefficient be- 
tween them yields a correlation coefficient of .74 for Table 5 and 
.98 for the maximum possible in Table 6. 

Increased reliability with number of items. If the errors of repro- 
ducibility behave at all like random errors of measurement, we 
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should expect that, as the number of items increases, then the rela- 
tionship will become closer between the ranks obtained from two 
separate samples. We would expect a much higher relationship be- 
tween the total scores on the present eleven items and the total 
scores on another eleven items from the same approximately scalable 
universe than we find in Table 5 between samples half the size. 
We would expect this, not because it is necessarily true that relia- 
bility of scores (with respect to samples of items) automatically 
must increase with the number of items, but because we are dealing 
here with a special case of an approximately scalable universe. It 
is not true in general for nonscalable universes that adding more 
items to a sample will increase stability of scores with respect to 
sampling of items. It is because we are working with a particularly 
simple kind of universe which has but one dimension, apart from 
scale error, that the proposition is true about increased reliability 
with respect to the sampling of items as the number of items in- 
creases. 

It would not be safe to employ anything like the Spearman-Brown 
prophecy formula often used by psychologists in this connection, 
because we know that the assumptions behind such a formula in 
general cannot be fulfilled by samples even from an approximate 
scale. Even if there were no scale error, sample ranks will in gen- 
eral not be perfectly correlated with each other. They would be 
simple functions of the universe scale scores, but that is quite dif- 
ferent from being the perfect functions of each other which is re- 
quired by the Spearman-Brown formula. 

How many items to be used in the final study? If a scalogram anal- 
ysis of a pretest of a dozen or so items shows reproducibility to be 
about .90 or more, and if the criterion of improvement is also satis- 
fied, then it seems safe to infer that the ranking of people obtained 
from the sample of items is a good approximation to the ranking 
that would have been obtained from the universe (with respect to 
the number of ranks desired). If the items are dichotomies, this 
means that about a dozen ranks are ascertainable by means of the 
sample. If there are more than two categories retained in some or 
all questions, then even more ranks can be determined by the sam- 
ple. 

In many studies it is desirable to use only four or five ranks or even 
less, since only gross comparisons between groups are intended to 
be made. For example, in the accompanying volumes on the con- 
tent of the Research Branch’s work, the population is divided very 


PROBLEMS OF RELIABILITY 801 


often only into thirds. In such a case, all the items of the pretest 
need not be retained for the final study, but only enough items to 
provide the desired ranks can be used. Since error is present, it is 
better to use more items than would have been used if no error were 
present. For example, if it is desired to divide a population into 
thirds, perhaps five dichotomies might be used to obtain six ranks, 
and then these ranks can be combined into three ranks with assur- 
ance that this will be a fair representation of the ranking in the uni- 
verse of items. It is doubtful whether even as many as a dozen 
questions are necessary in the final study for many problems which 
intend to use the ranks as rough quantitative numbers by treating 
them as though they were actual integers. In the example just dis- 
cussed in Table 3, the correlation between scores on five items is .93 
with the scores obtained on eleven items. For practical purposes, 
then, these five items could be used instead of the eleven almost in 
any situation where it was originally intended to use the eleven. 

This is one of the powerful features of the scale pattern. Rela- 
tively few items can be used in practice to represent the entire uni- 
verse of items, with assurance that the predictive power of the uni- 
verse is being maintained. Adding more items to the sample will 
not in general enhance the correlations of the scale scores with any 
outside variable to any appreciable extent. 

Reliability of the intensity function. It is the reliability of scale 
ranks with respect to sampling of items that determines the invari- 
ance of the intensity function discussed in Chapter 7. If content 
scale ranks on one sample of items are closely related to content 
scale ranks on another sample of items, and if intensity scores on the 
first sample are also closely related to intensity scores on the second, 
then the scattergram relating intensity to content cannot be very 
dissimilar between the two samples. When the data are plotted in 
the percentile metric, the zero points must approximately coincide, 
and the entire empirical intensity curves cannot deviate substan- 
tially from each other. 

Since intensity as obtained by present techniques is a quasi scale, 
it needs more items in a sample to ensure sufficient stability with 
respect to the universe of items. Furthermore, since current in- 
tensity scores are not pure functions of the content, the shape of the 
empirical intensity curve may not be reliable with respect to sam- 
pling of people unless a large sample is drawn from the population. 
If there were no error in the relation between empirical intensity 
and content, there would be no room for error with respect to the 
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sampling of people; the median intensity for each content score 
would have no dispersion. In practice, since there may be con- 
siderable spread about the columnar median intensities in the 
intensity-content scattergram, the sample medians can have corre- 
spondingly large sampling error with respect to people unless the 
large sample of the final study is used. In particular, the rank 
containing the interval with minimum intensity may not be safely 
determinable from only a small sample of people. 

Again, even if this rank containing the zero interval could be re- 
liably ascertained on the basis of a pretest, the proportions of ‘“posi- 
tive” and “negative” people in the population would still have the 
ordinary sampling distribution of a proportion. The usual num- 
ber of people used in a final opinion survey is needed to determine 
these proportions reliably. 

Intensity, as obtained by current techniques, in general cannot be 
studied very adequately only in the pretest. 


Test-Retest Reliability 


Variability within the respondent. The third problem of reliability 
with which we are concerned might be called that of variability 
within the individual respondent. If he were asked the same ques- 
tion over and over again under the same conditions—without carry- 
over from experiment to experiment—how much variation would 
there be in his responses? This problem of test-retest reliability is 
the usual one of errors of observation, but it has often been confused 
with the problem of sampling of items. It is important to keep 
these two reliability problems separate and distinct, for each re- 
quires a different analysis. 

It is only very recently that the theory of the test-retest relia- 
bility of qualitative data has been developed.*? A person’s response 
to a dichotomous question, for example, is to be called completely 
unreliable, or said to have zero reliability, if in the series of repeated 
trials he would give one response half the time and the other re- 
sponse the other half of the time. Crudely speaking, it might be 
said in such a case that the person’s response does not differ from 
what would be expected by “chance.” If the person does have a 
tendency to pick one category over the other, that is, if he chooses 
one category more than half the time, then he is said to have some 
reliability, or reliability greater than zero. If he always chooses the 


* Louis Guttman, “The Test-Retest Reliability of Qualitative Data,” Psychometrika, 
Vol. 11, No. 2 (June 1946), pp. 81-95. 
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same category in all the trials so that there is no variation in his re- 
sponse from trial to trial, he is said to have perfect reliability or 
reliability of unity. A coefficient to express the various degrees of 
reliability as numbers between zero and unity is based on the fre- 
quency of the category which the individual would choose most 
often in a series of trials. The category he would choose most often 
is called his modal category, and the relative frequency of this cate- 
gory is called his modal probability. 

The same concepts hold for items with more than two categories. 
In the case of a dichotomy, the modal probability for each person is 
always at least 14, since at least one of the categories must have a 
probability that big for him. In the case of a trichotomy, the 
modal probability for each individual is at least 4%. The individual 
will have zero reliability for a trichotomy if he appears equally often 
in all three categories during the universe of trials. If he is in one 
of the categories more than 14 of the time, then he will be said to 
have some reliability. Again, if he is in one of the categories all the 
time, he will be said to have perfect reliability. Similarly, for an 
item of four categories the modal probability for each individual is 
at least 144; the individual will have zero reliability if his modal 
probability is 44, and his reliability increases as his modal proba- 
bility exceeds 14. 

In general, if an item has m categories, the individual is said to 
have zero reliability if he is equally often in each of the m categories 
during the universe of trials; in this case his modal probability is 
1/m. If an individual’s modal probability is greater than 1/m, he 
will be said to have reliability greater than zero; if he is in one cate- 
gory all the time he will be said to have reliability of unity. 

A formula which expresses these various degrees of reliability is 
as follows. Let P; be the modal probability for the ith person in 
the population, and let R; be his reliability coefficient. The modal 
probability, P;, can never be less than 1/m and, of course, never 
greater than unity. The relationship between R; and P, is given by 
the following formula: 


ne ata(r-a) 
m-—1 m 


If P; is 3, then R, is zero. If P; is unity, then R, is also unity. R, 
1 


™ 


departs from zero as P; departs from 
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No need to perform a universe of trials. In practice, it is very 
difficult to perform a series of trials of the same individual on the 
same question without any carryover from trial to trial. Fortu- 
nately, it is not necessary to perform the series of trials in order to 
learn about the average reliability of the responses of the population 
of individuals. Only a single trial is needed in practice to provide 
information about what would happen if the universe of trials were 
to be performed. This is possible if a large population of individuals 
is observed on the single trial. 

If we let a be the arithmetic mean of the P; for the population of 
individuals, then we can deduce a minimum for @ on the basis of 
only a single trial. Accordingly, if we define p to be the arithmetic 
mean of the reliability coefficients, R;,, we can also state a minimum 
for p on the basis of a single trial. The relationship between the 
average reliability coefficient, p, and the average modal probability, 


a, is as follows: 
eee ee 2) 
m—1 m 


If a is equal to 1/m, then p equals zero; in this case each of the indi- 
vidual P, is equal to 1/m and each of the 2; equals zero. If a equals 
unity, then p also equals unity; in this case each of the P; is unity 
and each of the R; is also unity. An intermediate value of a implies 
an intermediate value of p; this is an average value for the popula- 
tion around which the individual reliabilities cluster. 

A single trial will not suffice to estimate a or p exactly, but it will 
suffice to give lower bounds or underestimates of a and p: Two 
kinds of lower bounds have been developed, the second of which is 
more powerful than the first.‘ Both of these lower bounds are very 
simple to compute. 

The marginal lower bound. The first lower bound can be estab- 
lished from the marginal frequencies of the item itself. The modal 
proportion of the item for the population of people is a lower bound 
for a. For example, in a dichotomy, if 80 per cent of the people 
are in one category and 20 per cent are in the other, then a is at 
least equal to .80. The average probability in a universe of trials 
—if it were to be performed—with which an individual would be 
found in his modal category is at least .80. Using this lower bound 
for a in the formula relating a to p, we obtain a corresponding lower 
bound to p: 


‘ Toid. 
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2 1 
p = 7(80 - 3) = .60 


The average reliability coefficient for the group thus is at least .60. 
It may be perfect; it may be .90 or anything else down to .60, but it 
is not less than .60. 

Similarly, for an item with four categories, say, if 60 per cent of 
the people are in one of the categories, then this is the modal cate- 
gory for the population; and the average modal probability for the 
individuals in the universe of trials is at least .60. The reliability 
coefficient for the group, then, has the following lower bound: 


p= $(00-3 = 47 


Lower bounds computed for the group reliability coefficient this 
way are called marginal lower bounds because they are based on the 
marginal frequencies of the distribution of the population on the item. 
In practice the marginal lower bound can grossly underestimate the 
group reliability coefficient. The second kind of lower bound is 
more efficient than the marginal lower bound. It is based on the 
correlation of the item with any other item or variable which is 
experimentally independent of it. For scale analysis, the best vari- 
able to pick is the scale score. For a discussion of the general ap- 
plication of the second kind of lower bound, the reader is referred 
to the basic article.’ We shall concern ourselves here only with 
how to bound the reliability of each item from the scalogram pat- 
tern. 

Reproducibility as a lower bound. From general considerations of 
scale theory, it should be clear that if a set of items has high repro- 
ducibility, then the items must necessarily have high test-retest 
reliability. If there were a substantial unreliability factor oper- 
ating in the responses to the items, this would create appreciable 
scale error ; there would be more than a single factor present. Hence, 
if scalogram analysis shows that essentially only a single factor is 
operating in the responses, this must mean that there cannot be 
many additional factors, including unreliability. This can now be 
shown explicitly in terms of the second kind of lower bound to the 
reliability coefficient. 

If all items were equally reproducible, then their common repro- 
ducibility would be the reproducibility coefficient for the set of items. 


5 Ibid. 


3806 PROBLEMS OF RELIABILITY 


If, furthermore, each item were experimentally independent of the 
scale score—that is, if the variation from trial to trial in the universe 
of trials of each person’s response to the item and his scale rank were 
independent of each other—then the reliability coefficient of the set 
of items is a lower bound to the average modal probability, a, for 
each item separately. Therefore, in such a case, if r is the reproduci- 
bility coefficient, then a lower bound to the reliability of each item 
is given by: 


If the reproducibility, r, is .90 and if an item is dichotomous, then 
m = 2, and the reliability coefficient for that dichotomy is not less 
than .80: 


p = 7(.00 - 5) = .80 


If an item is trichotomous, then m = 3, and a lower bound to the 
reliability coefficient for the item is: 


If an item has four categories, then m = 4, and a lower bound to 
the reliability coefficient for such an item is: 


pz +(.00 = 7) = 867 


If an item has five categories, then m = 5, and a lower bound for 
pis: 


p2 (90 = 3) = .875 


Thus we have an important relationship between the coefficient 
of reproducibility and the test-retest reliability of each item in the 
set being analyzed. 

It may be noticed that, for fixed reproducibility, the lower bound 
for an item goes up as the number of categories goes up. But the 
increment becomes progressively smaller, so that there is not much 
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difference between the lower bounds of items with four and five 
categories if they have the same reproducibility. 

It is important to remember that only lower bounds to the test- 
retest reliability of the items are being determined this way. Upper 
bounds cannot be established from only one trial. At least two 
experimentally independent trials are required to set an upper 
bound.* An item may have perfect test-retest reliability for each 
individual in the population and still have low reproducibility. The 
property of reproducibility refers to the structure of the universe of 
items. The property of test-retest reliability refers to the universe 
of trials of the same individuals on the same item. These are quite 
distinct concepts and are related only by the inequalities we have 
just been working with. 

The assumptions involved. A basic assumption in the second kind 
of lower bound is that the item be experimentally independent of the 
score it is being related to. This will ordinarily not hold if very few 
items, say only five or six, are analyzed by the scalogram technique. 
The item itself contributes to the total score and therefore, in 
general, will have its error of unreliability related to the error of 
unreliability of the total score. The proper way of using reproduci- 
bility as a lower bound for test-retest reliability would be to com- 
pute scale scores for individuals on all the items in the sample except 
for the one whose test-retest reliability is being studied for the mo- 
ment. Then the errors of observation can be experimentally inde- 
pendent. The possible spuriousness of keeping the item in the total 
score does not seem to be too serious when a dozen or so items are 
used, if the reproducibility of the whole set is about .90; as we have 
seen with respect to the sampling of items, it is clear that in such a 
case scores based on ten items cannot differ very much from those 
based on eleven items. Since reproducibility is a lower bound any- 
how, overestimating the lower bound spuriously by keeping the 
item in the total score should still yield a lower bound to the relia- 
bility coefficient itself, since rarely would we expect the reliability 
coefficient to be equal to any particular lower bound obtained in 
practice. 

If the items in the set do not have equal reproducibility, then 
using the overall coefficient of reproducibility as a lower bound does 
not actually yield a lower bound for each item separately. Instead, 
it yields a lower bound to the average reliability coefficients of the 
items. If it is desired to establish a lower bound for each item sep- 


* Ibid. 
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arately, then the actual reproducibility of the item should be used 
in the formula for lower bound, instead of the overall reproducibility 
coefficient for the set. 

The test-retest reliability of scale scores. Thus far we have investi- 
gated the test-retest reliability of responses to separate items. We 
have seen that this is necessarily very high for approximately scal- 
able items. As a consequence, it should follow that test-retest 
reliability of scores derived from these items should be very sub- 
stantial. Again, it should be remembered that we are concerned 
here with a universe of trials of the same people on the same items; 
we are not concerned with the problem of sampling of items—that 
was discussed in a previous part of this chapter. Our problem now 
is: how much variation will there be in a person’s scale score if he 
were to be given the same items over and over again—without carry- 
over from trial to trial? This is different from the problem previ- 
ously discussed of how much variation there will be between a per- 
son’s score obtained on one sample of items and that obtained on 
another sample of items. 

The basic theory and working formulas for this problem of test- 
retest reliability of scores have been developed recently elsewhere.’ 
The problem of quantitative scores is a bit different from that of 
qualitative items because of a different definition of error. For a 
qualitative item, we predict the category a person is in either cor- 
rectly or incorrectly, and amount of error is defined in terms of the 
total number of incorrect predictions. For quantitative variables, 
predictions are almost always incorrect; they are rarely exact. But 
small errors are of no concern; it is the large errors that are impor- 
tant. Hence, error for quantitative variables is measured in terms 
of the size of errors rather than the number of errors. In the quali- 
tative case, we dealt with modal categories and measured error in 
terms of frequencies. In the quantitative case, we shall use arith- 
Inetic means and measure error in terms of variances. 

On each trial in the universe of trials, each person would obtain a 
total score on the given items. Let e,? be the variance over the 
trials for the ith person’s total score. His score will be said to be 
completely reliable if his error variance, e,*, is zero. The greater 
e,;? is, the more unreliable will the 7th person’s score be. The prob- 
lem of test-retest reliability of scores is to obtain information about 
e;? for each person. To do this exactly would require an actual 


7 Louis Guttman, “A Basis for Analyzing Test-Retest Reliability,” Psychometrika, 
Vol. 10, No. 4 (December 1945), pp. 255-282. 
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series of independent repetitions of the experiment. In practice, it 
is difficult, if not impossible, to conduct repeated trials without hav- 
ing carryover from trial to trial or else a change in the conditions. 
It is fortunate that only a single trial 1s needed to provide useful in- 
formation about the average test-retest reliability of the scores for 
the population of individuals. 

The reliability coefficient for scores. Let e? be the arithmetic mean 
of the e,;? for the population. If e? is zero, then the population scores 
will be said to have perfect test-retest reliability; in this case each 
of the e,;*? is zero (each individual has zero error variance). The 
larger e? is, the larger are the individual error variances on the aver- 
age. On the basis of a single trial, it is possible to set an upper 
bound to e?. 

Instead of working with e?, however, it is conventional to convert 
the error variance into a reliability coefficient which varies between 
zero and unity. For the problem of total scores from a set of items, 
this coefficient is defined to be? 


where g;,? is the variance of the total scores over the population of 
individuals and over the universe of trials. It can be proved that ¢* 
can never be bigger than o,”. If e is zero, then the reliability co- 
efficient, p,, is unity ; this is the case of perfect test-retest reliability. 
If ¢? is equal to o,? (which is its maximum), then the coefficient, 
Pp”, is zero, which is called the case of perfect unreliability. Inter- 
mediate values of ¢? yield intermediate values of p,?. 

The split-half lower bound. On the basis of only a single trial it 
is possible to set a lower bound to the reliability coefficient, p,? 
(this is equivalent to setting an upper bound to e”). Six different 
lower bounds have been developed in the original paper on this sub- 
ject,® some of which are better than others under certain conditions. 
Perhaps the formula which will give the highest lower bound for 
scalogram analysis is the split-half lower bound, designated by Iu. 
(It is the fourth in the series of six bounds.) The computations for 
the lower bound, Jy, are very simple and will be illustrated for the 
data on attitude toward officers used in Tables 3 and 5. To com- 
pute J, it is required that the sample be divided into two halves so 


8 Ibid., p. 262f. 
* Ibid. 
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that there are two half scores available, as well as the whole score 
which is the sum of the two half scores. All that is needed, then, 
is to compute the variances among people of these three sets of scores 
on a single trial. Let s,? be the variance of the scores in the first 
half of the test; let s,? be the variance of the scores in the second 
half of the test; and let s,? be the variance of the total score. Then 
the formula for the lower bound is: 


=2(,—- sts 
Ly = 2(1 A ) 


The variances of the two half scores can be computed from the 
marginals of Table 5. For the five questions in the first half of the 
sample, a, = 1.99. For the six questions in the second half, 
s,? = 2.35. The distribution of total scores is given in the mar- 
ginals of Table 3, whence we ascertain that s,? = 7.53. Substi- 
tuting these three variance values into the formula for I,, we ob- 
tain: 


_ 1.99 + 235) 2 ae 


=32 (1 7.53 


This means that p,’, or the reliability coefficient for the total score 
on the eleven items for the population of soldiers, is not less than 
.85. It may be anything between .85 and unity. 

This split-half lower bound, Ly, will tend to be higher according as 
the correlation between the two half scores increases. Therefore, 
it is desirable to split the sample into two halves in such a way as 
to maximize the correlation between the two halves. For a highly 
reproducible set of items it is clear that two halves can usually be 
obtained whicli will have a very substantial intercorrelation, ac- 
cording to our discussion of the problem of sampling of items. The 
halves should be chosen in such a way as to make their marginal 
distributions as similar as possible, because, as we have seen previ- 
ously, two half scores cannot correlate very highly if they have 
widely different marginals. By pairing off items with similar fre- 
quency distributions, and using one from each pair in each half of 
the sample, it will usually be possible to obtain part scores which 
have similar distributions and which will have a rather high inter- 
correlation. This is what was done for the present example on 
attitude toward officers. As a result, the lower bound to the test- 
retest reliability coefficient will be high. 
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The split-half lower bound, Z,4, resembles somewhat the tradi- 
tional Spearman split-half formula, corrected for test length. In- 
deed, it will be numerically equivalent to Spearman’s formula if 
8.” equals 8,2. However, it avoids the stringent assumptions be- 
hind the Spearman formula, and indeed proves that if Spearman’s 
assumptions are not satisfied by the data, then his formula in 
general underestimates the test-retest reliability coefficient. The 
lower bound, Z,, assumes only that the two half sample scores are 
experimentally independent of each other and that the computa- 
tions are based on a large population of individuals. It does not 
assume that the two halves belong to the same scale or anything 
else of the kind. If the two halves happen to be parts of the same 
scale, then LZ, will prove to be high. If the two parts do not hap- 
pen to be scalable together, then LZ, may be low. A low Ly does 
not imply that the two halves—or their sum, the total score—have 
low test-retest reliability. The test-retest reliability of the scores 
may be perfect, yet L, can be zero. Unreliability is but one possible 
reason why variables are uncorrelated. They may be uncorrelated 
even though they are perfectly reliable. 

For the case of the scale scores then, as well as for the qualitative 
responses to the separate items, we have assurance that if the items 
are approximately scalable, then they necessarily have very sub- 
stantial test-retest reliability. Scalogram analysis provides as an 
automatic byproduct the assurance that responses to individual 
items and total scores both have relatively little error of measure- 
ment if the reproducibility is high. 


CHAPTER 9 


THE PRINCIPAL COMPONENTS 
OF SCALE ANALYSIS’ 


he problem of metric. If a set of items conforms to the perfect 

scale pattern for a population of individuals, then there is a 
meaningful rank order on those items for the individuals. The 
rank metric is all that is necessary for one important aspect of ex- 
ternal prediction. Any outside variable can be predicted as well 
from the rank order as it could be from any other scores into which 
the ranks might be transformed. Therefore, for those problems in 
which the scale is to be used only as a predictor, there is no need to 
seek any metric other than the rank order. 

On the other hand, if the scale is to be predicted from external 
variables, and if the prediction is not perfect, then there is some 
need to consider the problem of metric. The prediction process 
must minimize errors in some sense. Different metrics will define 
error differently, so that different metrics may require different 
prediction processes. 

The problem of metric can also arise for internal as well as for ex- 
ternal problems. One such internal problem that we have already 
faced is that of defining a zero point for an attitude. There may 
be other internal problems arising in the future, and for these it may 
be desirable to have an intrinsic metric defined for a scale. 

The equations of scale analysis to be presented in this chapter 
show what internal metric, apart from rank order, is involved in the 
scale pattern. The metric depends on two sets of parameters for 
the scale: the frequencies of the scale types of persons and the fre- 
quencies of the types of scale items. Different frequency distribu- 
tions give rise to different metrics for scale scores. It is interesting 
to note that rank order turns out to be a metric arising from a cer- 
tain pair of frequency distributions. 

If we had knowledge of the frequencies of the types of items and 


1 By Louis Guttman. 
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types of persons, then we would know how to transform the scale 
ranks into the proper internal metric of the scale, if rank order was 
itself not the internal metric. Such knowledge seems impossible a 
priori because of the manner in which items are obtained. Any 
empirically obtained frequencies are functions of the particular 
questions constructed by the research worker, and are subject to the 
vexing difficulties concerning sampling the universe of content 
(see Chapter 8). 

Among the most interesting features of the equations of scale 
analysis are the new approach they suggest to solving the problem 
of sampling of items and the new light they throw on how to inter- 
pret the content of scales. These are unexpected results that have 
come out of the quest for an internal metric. The work on the in- 
tensity function (see Chapter 7) was one of these results; this work 
was a direct outcome of the mathematical analysis of the scale pat- 
tern. The intensity function is but one of infinitely many functions 
inherent in a scale; it is the first and only one to date to be identi- 
fied empirically.2, These functions are called principal components. 

Let us see how a scale pattern has inherent in it the intensity and 
other principal components, and how these relate to the problem of 
metric. 


Description and Illustration of the Scale Equations 


For the nonmathematical reader. The notion of principal com- 
ponents is an important one in mathematics and in the physical 
sciences. It has occurred also in the field of factor analysis for 
mental testing.* In the present chapter we shall show how it 


* Added in proof: The writer has now discovered empirically the next function, or 
third component of a scale, in his work for the Israel Institute of Public Opinion Re- 
search, as reported at the Third Congress of the World Association of Public Opinion 
Research at Paris in September 1949. This report will be published soon. 

The third component is called closure, which seems to be a scalable attitude. When 
related to the content scale of any one attitude of the larger set, it yields a curve with 
two bends, one on each side of the zero point of content. This can be illustrated from 
a study of attitudes toward agriculture in Israel. The bending points on definiteness 
of plans as to farming divided those favorable to agriculture into the more positive 
and the less positive; the more positive having no stronger alternative than agriculture, 
the less positive really preferring something else although liking agriculture. Similarly 
closure separated the more negative from the less negative; those with a definite alterna- 
tive being the less negative, those with no alternative (that is, unfavorable to agricul- 
ture in spite of having no alternative) being the more negative. 

* See L. L. Thurstone, The Vectors of Mind (University of Chicago Press, Chicago, 
1935), especially Chapter 4. The most extensive investigation of this problem for 
factor analysis has been made in Harold Hotelling, ‘“‘Analysis of a Complex of Statistical 
Variables into Principal Components,” Journal of Educational Psychology, Vol. 24, Nos. 
6, 7 (September, October 1933), pp. 417-441, 498-520. 
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emerges naturally, in a special form, for the problem of quantifying 
qualitative data, with striking results for the scale pattern. 

The first half of this chapter is addressed primarily to the non- 
mathematical reader. In descriptive terms, we shall present the 
criterion of internal consistency for a metric that leads to the equa- 
tions that generate the principal components. 

For a first reading, it is suggested that the reader not pause very 
long at any one point until he has worked through all the numerical 
examples in the first half of this chapter. The actual mathematical 
derivations and proofs are reserved for the second half. 

Scores and weights are not necessary to each other. In quantifying 
a set of qualitative items, there are two different sets of numerical 
values that can be determined. One set is the weights to be assigned 
the categories of the items, and the other is the scores to be assigned 
the people. The principle we shall use for quantification, or obtain- 
ing a metric, for each of these sets is that of maximizing internal 
consistency in the sense of least squares. The appropriateness of 
this criterion for the case of the perfect scale is evidenced by the 
remarkable results that flow from it. 

The most internally consistent scores to assign the people on the 
basis of their responses to the items are those that satisfy the follow- 
ing condition. All people who fall in one category of an item 
should have scores as similar as possible among themselves, and as 
different as possible from the scores of the people in the other cate- 
gories of the item ; this should be true to the best extent possible for 
all items simultaneously. The total variance of the persons’ scores 
can be expressed as the sum of two parts: the variance of the scores 
within categories, and the variance between categories. Maximizing 
similarities within categories and differences between categories im- 
plies maximizing the ratio that the variance between categories has 
to the total variance. The square root of such a ratio is called 
technically a correlation ratio, so our problem resolves itself into 
finding the set of scores that has the largest possible correlation ratio 
with respect to all the items. 

Similarly, the most internally consistent values to assign the cate- 
gories are those that satisfy the following condition. All categories 
characterizing one person should have numerical values as similar 
as possible among themselves, and as different as possible from the 
values of the categories that do not characterize that person; this 
should be true to the best extent possible for all people simultane- 
ously. The total variance of any numerical values assigned the 
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categories can also be expressed as the sum of two parts: the vari- 
ance of the values within people, and the variance between people. 
Maximizing similarities within people and differences between 
people, then, calls for determining the set of numerical values for 
categories that will have the largest possible correlation ratio with 
respect to all the people. 

It is only to adhere to a conventional terminology, and to help 
distinguish between values for people and values for categories, 
that we shall use the term weight to refer to a numerical category 
value. 

It is important to observe that if we want only scores for people, 
we do not have to pose the problem of weights for categories. We are 
not specifying in advance that scores are to be obtained by adding 
up category weights. We specify only the criterion for the scores 
to satisfy ; how to calculate the scores will depend on what the re- 
sulting equations show. In fact, the equations for the most in- 
ternally consistent scores can be set up and solved without ever 
introducing the notion of any kind of category weights at all. In 
exactly a similar fashion, the equations for the most internally con- 
sistent weights can be set up and solved without ever introducing 
the notion of any scores at all for people. A metric for people and 
a metric for categories are two independent problems that can each 
be solved separately. 

The equations of internal consistency. The equations for these two 
problems were derived over eight years ago.‘ It was also proved 
that a simple relationship exists between the optimum scores and 
the optimum weights. The maximum possible correlation ratio for 
scores is precisely equal to the maximum possible correlation ratio 
for weights, and a reciprocity exists between the maximizing scores 
and the maximizing weights. The score of a person is proportional 
to the arithmetic mean of the weights of the categories by which he 
is characterized, and the weight of a category is proportional to the 
arithmetic mean of the scores of the people who are in it. 

Hence, if we know the optimum scores, we can calculate the opti- 
mum weights by computing arithmetic means of the scores. Con- 
versely, if we knew the optimum weights, we could calculate the 
optimum scores by computing arithmetic means of these weights. 


* Louis Guttman, “The Quantification of a Class of Attributes: A Theory and Method 
of Scale Construction,” in P. Horst, et al., The Prediction of Personal Adjustment (Social 
Science Research Council, New York, 1941), pp. 319-348. 
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Now, each person has as many category weights to average as every 
other person, because he has one and only one response to each item. 
Therefore, the arithmetic means of weights are proportional to the 
sums of the weights, differing by the constant reciprocal of the num- 
ber of items. As a consequence, the optimum scores are also di- 
rectly proportional to the sums of the optimum weights, or the scores 
can be obtained from the weights by essentially an additive process. 

The converse is not true. Weights cannot be obtained from only 
direct sums of the scores, because the number of people is not con- 
stant from category to category. Each category needs its own di- 
visor to obtain the arithmetic mean of scores. 

The equations of internal consistency, then, provide justification 
for assigning weights to items to be added up to provide scores for 
the people. And, what is more important, they show exactly what 
weights and scores to use if the frequency distributions are known 
for both the population of individuals and the universe of items. 

Scores and weights that ‘‘go in a circle.” The solution to the equa- 
tions of internal consistency can be put in the following terms. Find 
a set of scores for the people such that, if weights are derived from 
these weights by adding them up for each person, then the new 
scores will be exactly proportional to the original scores. That is, 
find a set of scores that ‘‘go in a circle”: they yield weights that 
give back the same scores. Such a set of scores is called a principal 
component® of the system. 

Or else we could start with the weights. Find a set of weights 
for the categories such that, if scores are derived by adding them up 
for each person, and if new weights are derived from these scores by 
averaging them for each category, then the new weights will be ex- 
actly proportional to the original weights. Such weights can also 
be said to “go in a circle’: they yield scores that give back the 
same weights. Such a set of weights is also called a principal com- 
ponent. 

If a set of weights goes in a circle, then the scores obtained from 
them also go in a circle. Indeed, we might say that it is the same 
circle involved in both cases. It turns out further that the correla- 
tion ratio is the same for scores as for weights if they are in the same 
circle. To use descriptive language again, we shall say that this 
common correlation ratio indicates the ‘“‘size’” of the circle. 


* Principal components are an important concept in mathematics and in physics. 
It is interesting that they should also arise out of social-psychological considerations. 
Other names used for them include: principal axes, latent vectors, characteristic func- 
tions, characteristic vectors, eigenfunctions, and eigenvectors. 
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A numerical example. Before going further into properties of 
principal components, let us consider a numerical example of what 
it means to “go in a circle.” Suppose that a universe of dichot- 
omous items has only five types of items, yielding six types of 
people. Suppose further that the types of items are all equally 
frequent, and that the types of persons are all equally frequent. 
Each type of item represents one fifth of all the items in the uni- 
verse, and at each scale rank there is one sixth of all the people. 
The scalogram is as in Table 1. 

The two categories of the first type of item are labeled la and Ib, 
and similarly for the other types of items. The types of persons are 
labeled according to their rank, from five to zero. Since each kind 


TABLE 1 


Scatyp Patrern or A UNIvensy or Five 
Kinps oF DicHoTomizs 


Type of Type of category Frequency of 
Pee la 2 Sa fa 6a 1b 2 Sb 4b 5b reeren 
5 x x x x x 5 
4 x x x x x 5 
3 x x x x x 5 
2 x x x x x 5 
1 x x x x x 5 
0 x x x x x 5 
Frequency 
of response 1 2 3 4 5 5 4 3 2 1 30 


of item has the same frequency of occurrence as any other item, and 
since each kind of person has the same frequency of occurrence as 
any other kind of person, the relative frequencies of response of 
people to categories can be ascertained by considering each z in the 
table to have the same frequency as every other 2, that is, each x 
represents 1/30 of all the responses of the population of people 
to the universe of the items. (If the types of people and/or the 
types of items did not have uniform frequencies, the z’s in Table 1 
would have correspondingly unequal frequencies.) 

For this example of uniform frequencies for both people and 

* Two items are said to be of the same type if they are perfectly correlated with each 
other. Thus, in a scale, all dichotomies with a 90-10 marginal are of one type, all 
dichotomies with an 80-20 marginal are of a second type, etc. Frequency of type of 


item, then, is quite different from—and independent of—the frequency distribution of 
reaponses to the type. 
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items, we shall verify that the rank order is essentially the intrin- 
sic least squares metric, that is, that rank order scores essentially 
“yo in a circle,’ and furthermore, have a larger circle or correlation 
ratio than any other possible set of scores. 

First, we shall shift the numbers assigned the ranks so as to make 
the mean rank zero. As will be explained later, principal com- 
ponent scores must have zero as their mean. For convenience, we 
shall use whole numbers for the translated ranks, and replace the 
numbers 5, 4, 3, 2, 1, O by the respective numbers 5, 3, 1, —1, —3, 
—5. These latter numbers we shall prove are a principal com- 
ponent of the scale. By adding 5 to each of these numbers, and 
then dividing by 2, the original rank numbers are obtained, so the 
ranks are a perfect linear function of this principal component. 

To see that the scores “‘go in a circle,” let us first ascertain the 
weights they yield for the categories. Category la has only one 
kind of person, whose new score is 5. Hence, the mean of the scores 
of this kind of person is also 5. Category 2a has two kinds of 
people, one with score 5 and the other with score 3. Since each 
type is equally frequent for this example, the mean is 4. Category 
5+3+1_ 3 

3 ’ 
and so on for the rest of the categories. We thus have the follow- 
ing table of weights, or mean scores, for the categories: 


3a has the mean score of three kinds of people: 


TABLE 2 


Catecory WeicsTs DERIVED From Metric ScorsEs 


Category la 2a 3a 4a 5a lb 2b 3b 4b = 5b 


Weight 


5 4 3 2 1 —-1 -2 -3 -—4 —5 


A check on the work is that the mean of the weights of each item, 
taking into account frequency of response, must be zero. Category 
1a has a relative response frequency of 1 and category 1b has a rela- 
tive response frequency of 5. Multiplying the category weights by 
these frequencies yields the mean weight for items of type 1: 


(5)(1) + (—1)(5) = 0 
Similarly, the mean weight for items of type 2 is: 


(4)(2) + (—2)(4) = 0 
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All the remaining items also have zero mean weights in this fashion. 
Now we have to verify that the weights in Table 2 give the metric 
scores back again. A person in the top rank of Table 1 is in the 
first five categories; these categories have the respective weights 
(Table 2) of 5, 4, 3, 2 and 1, so that the mean of the weights for 

him is: 
54+4434+241_ 


5 3 


Each type of item is equally frequent for this example, so that the 
weights occur equally often for the person and can be averaged in 
this simple fashion. Similarly, the next highest type of person in 
Table 1 is in the categories with the respective weights (Table 2) of 
4, 3, 2, 1 and —1, so his mean is: 


44+3+2+1-1_ 


5 1.8 


The mean of the weights for the third row of Table 1 is: 


$4+2e1—1=—2.. 


5 6 


By taking similar arithmetic means of weights for the remaining 
people, we obtain scores that can now be compared with the original 
scores. This is done in Table 3. The new scores just derived as 
the means of the weights (which weights were in turn derived from 
the original scores) are called here the “circular” scores. Notice 
that each circular score is precisely .6 times the original score, or the 
circular scores are strictly proportional to the original scores. Hence 


TABLE 3 


RELATION BETWEEN ORIGINAL SCORES 
AND “CrRcULAR” ScoREs 


Type of Original “Circular” 


person score score 
5 5 3.0 
4 3 1.8 


3 1 6 
2 -1 — 6 
1 -3 —-1.8 
0 -5 —3.0 
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these scores do actually go in a circle, and they constitute a principal 
component of the scale. 

It can be proved that the constant of proportionality, which in 
this case is .6, is always the square of the correlation ratio of the com- 
ponent on the items. This we shall call here the “‘size of circle.’ 
It can further be proved that .6 is the largest possible size of circle 
for our example, and that the scores we are using are the only pos- 
sible’ ones that go in this circle. They are the most internally con- 
sistent scores in the sense of least squares. 

It should be clear that scores that go in a circle are defined only 
up to a constant of proportionality. Any constant multiples of 
them will also goin acircle. This is like saying that if scores meas- 
ured in inches go in a circle, then scores measured in centimeters 
also go in a circle. For example, if the “circular” scores in Table 3 
are now used as original scores, they will also go in a circle. But 
it will be precisely the same circle as previously, for the new “cir- 
cular’’ scores will be exactly .6 times the present “‘circular’’ scores. 

If new weights are derived from the “circular” scores in Table 3, 
they will prove to be circular to the old weights of Table 2, being 
exactly .6 times the previous weights. Hence, the weights of Table 
2 also go in a circle of size .6. Any weights proportional to these 
weights will belong again exactly to this selfsame circle. 

There are many principal components for a configuration of re- 
sponses. It turns out that, for any configuration of responses to 
items, whether it is the scale pattern or not, there exist scores and 
weights that go in a circle, and hence are the most internally con- 
sistent for that configuration. Equally important is the fact that 
in general there is more than one set of scores and of weights which 
go in a circle, albeit the circles will vary in size. There are many 
such sets, in fact, usually at least as many as there are distinct types 
of items. Any such set is also called a principal component of the 
system. A set of scores that goes in a circle is a principal com- 
ponent of the score system, or of the quantification of persons’ ranks ; 
a set of weights that goes in a circle is a principal component of the 
weight system, or of the quantification of categories values. 

Each principal component is associated with a correlation ratio,® 
the square of which is the “size of circle.’ Hence, of the many 
principal components, the one with the largest correlation ratio is 


7 All sets of scores proportional to a given set are considered to be essentially the same. 
* The square of the correlation ratio is called technically a latent root of the system. 
Other names used for this are: characteristic root, characteristic value, and eigenvalue. 
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the most consistent quantification. The one with the next largest 
correlation ratio is the most consistent after the first component is 
subtracted out, etc. It is the first component, or the one with the 
largest correlation ratio, that defines our desired metric; we shall 
call this the metric component. 

Examples of further principal components. For our numerical 
example of Table 1, it can be verified that the following set of scores 
will also go in a circle: 5, —1, —4, —4, —1, and 5, assigned in this 
order to the person types. The ranks of these scores are no longer 
according to the scale rank order. Instead, the scale rank order is 
“folded over.” The persons with the highest and lowest ranks on 
the scale (types 5 and 0) have the same rank on this second com- 
ponent. Similarly, types 4 and 1 have the next highest rank; and 
the middle types 3 and 2 have the lowest rank on this second com- 
ponent. 

The size of circle now is smaller, for the reader can verify that the 
constant of proportionality for the ‘circular’ scores obtained from 
these scores is precisely .2, which is one third of the size of circle 
(.6) for the metric component. This means that the second com- 
ponent scores by themselves do not discriminate as well between 
the responses of the people to the items as do the metric component 
scores. It is easy to see that they could hardly do so in this ex- 
ample, because they implicitly combine the given six types of per- 
sons into only three types, according to the folding over of scale 
ranks. It can be proved, however, that after taking into account 
the metric component, there is none with a bigger circle than this 
second one. 

Another principal component, which has the third largest possible 
circle (after the first two are subtracted out), is the following set of 
scores: 5, —7, —4, 4, 7, —5, assigned to the types of persons in 
Table 1 in this order. The size of circle here is .1. 

The fourth most consistent principal component scores are: 1, 
—3, 2,2, —3, 1, with .06 as the size of circle. 

The fifth most consistent principal component scores are: 1, —5, 
10, —10, 5, —1, with .04 as the size of circle. It can be proved that 
this also is the smallest possible circle for our example, so that this is 
also the most inconsistent component. Any other set of numbers, 
whether a principal component or not, cannot have the square of its 
correlation ratio less than .04 for five dichotomous and scalable 
types of items which have uniform frequencies and where the types 
of persons also have uniform frequencies. 
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If we add up the five sizes of circles from the five principal com- 
ponents, we obtain unity: 


6+ .2+.1+ 06+ 04=1 


that is, the sum of the squares of the correlation ratios is unity. 
The total variation of all the responses in Table 1, expressed as 
unity, has actually been resolved into precisely the sum of five com- 
ponents. This implies that from the five component scores and 
weights, we can reproduce every person’s answer to every question 
in Table 1. How to do this will be shown a few sections below. 
It can be proved that there is no other set of numbers, except a 
constant set, that goes in a circle apart from the above five. There 
are five and only five principal components (and one constant one) 


TABLE 4 


PrincipaL CoMPoONENT SCORES FOR THE SCALE OF TABLE 1 


Type of Principal component score 
rere: I mq om ov Vs constant 
5 5 5 5 1 1 1 
4 3 -1 -7 -—3 -6 1 
3 1 —4 ~-4 2 10 1 
2 -1 —4 4 2 —10 1 
1 —3 -1 7 —3 5 1 
0 —5 5 -5 1 -i1 1 
Size of circle 6 2 al .06 .04 1 
Sum of squares 70 84 180 28 252 6 


for a universe of five kinds of dichotomous items. In general, for 
dichotomous items, the number of nonconstant principal compo- 
nents is equal to the number of types of items. 

A constant set of numbers always goes in a circle. Such a set of 
constants is always proportional to the set of one’s: 1, 1, 1, 1, 1, 1. 
Using these in Table 1 yields weights all equal to unity, which in 
turn give back “circular” scores of unity, so that the size of circle 
is itself unity. However, the size of circle here is not the square of 
a correlation ratio, since the mean of its scores is not zero. All non- 
constant principal component scores have zero as their mean. The 
constant component plays a role analogous to that of the addi- 
tive constant in a regression equation, and is actually needed later 
on for our reproducing process to work out exactly. Thus, while 
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the size of circle for the constant component is the largest possible, 
it is not to be interpreted as a correlation ratio squared, but we do 
have to take it into account for completeness later on. 

The oscillatory nature of the principal component scores of a scale. 
Let us bring together the principal component scores of Table 1 into 
Table 4. The components are labeled with Roman numerals ex- 
cept for the constant one. Column I has the metric component 
scores ; column II has the second component (or what is now identi- 
fied as intensity) scores, etc. 


A C third component 
first 
component 


D fourth component 


B second 
component 


5 ° 1 2 3 4 § 
TYPE OF PERSON TYPE OF PERSON 


(RANK ORDER) |. (RANK ORDER) 


Figure 1. The principal components of the scale of Table 1. 
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For the perfect scale, such component scores have a definite rela- 
tion to the rank order of the people. This relationship is shown 
graphically in Figure 1 for each component separately.° 

For this example of uniform frequencies for types of persons and 
for types of items, component I is a perfect straight line function of 
the rank order (Figure la). Hence, rank order is essentially the 
intrinsic metric for this kind of scalable universe. 

The second component is a U-shaped function of the rank order 
for this example (Figure 1b). In fact, the points all lie on a perfect 
parabola here. The second component is identified psychologically 
as the intensity component. Since the curve is symmetrical about 
the middle rank, this indicates that the zero point is between ranks 
2 and 3; that is, the zero point separates the component I scores 
~landi1. This is as it should be; the zero point separates posi- 
tives from negatives on the metric component. If we had an infi- 
nite number of types of items instead of just five as in the present 
example, and if the frequencies remained uniform for both items 
and people, it can be proved that the infinite number of points for 
the second component would form a perfectly smooth parabola 
whose lowest point will be precisely at the zero of the metric scores. 
The discrete jumps in Figure 1 are due to the fact that we have only 
a finite number of types in this example. 

The curve for the third component (Figure 1c) has one more bend 
than that for the second component. For the universe of uniform 
frequencies like our example, it can be proved that the curve is al- 
ways that of a third degree polynomial. (A parabola represents a 
polynomial of the second degree.) It could serve to divide the 
population into three groups, according to its turning points. Per- 
haps in the future there will be found a psychological interpretation 
of these bending points, just as the bending point of the parabola 


* The scores in Table 4 are, for simplicity, deliberately expressed in the form of whole 
numbers, even though this forces the standard deviations to be arbitrarily unequal. It 
should be remembered that principal components are determined only up to a constant 
of proportionality: any multiple of a principal component is also a principal component 
that goes in the same circle and so is not regarded as a new component. This is like 
changing feet to inches or to centimeters, or pounds to ounces or to grams. The stand- 
ard deviation changes in direct ratio to the multiplier. It is convenient to use multi- 
pliers that will make the standard deviations of the components proportional to their 
respective correlation ratios. Thus, the metric component—which has the largest size 
circle—will have the largest standard deviation, while component V—which has the 
smallest size circle—will have the smallest standard deviation. The vertical scales in 
Figure 1 have been chosen to provide the multipliers needed to make the standard devi- 
ations proportional to the respective correlation ratios; the relative sizes of circle are 
apparent graphically from the average vertical distances of the plotted points from the 
zero line. 
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has been identified with the zero point of the attitude or opinion.” 

Similarly, the fourth and fifth components have curves with three 
and four bends in them respectively (Figures 1d and le). For our 
example, these represent fourth and fifth degree polynomials. 
Again, these may be found in the future to provide psychologically 
meaningful cutting points in the rank order of the people. 

It can be proved that there are as many nontrivial principal com- 
ponents for a universe of dichotomies as there are different types of 
items. Since our example has five types of items, it also has five 
nontrivial components. Of necessity, there can be at most six dis- 
tinct metric scores, one for each scale type of person. In order to 
obtain a continuous metric, there must be an infinite number of 
types of items and thus also an infinite number of types of people. 
Then there will also be an infinite number of principal components. 

It can be proved that the oscillatory nature of the principal com- 
ponents, as illustrated in Figure 1, also holds if the number of types 
of items is infinite. The infinite case, with a continuous metric in- 
stead of discrete points, is the one usually hypothesized in practice. 
For this case, if the frequencies remain uniform both for types of 
items and types of people, then the first, or metric, component will 
be a continuous parabolic function. The third component will be 
expressible as a continuous polynomial of the third degree, etc., etc. 
Only now there will also be an infinite number of principal com- 
ponents instead of just five asin Figure 1. Each will have one more 
oscillation in it than the preceding one; for the kind of universe dis- 
tribution we have hypothesized, each will be expressible precisely 
as a polynomial of one degree higher than the preceding one.?! 

More generally, the oscillatory nature of the principal components 
of a scale holds no matter what the distributions of types of items and 
types of persons are. Some types of persons can be more numerous 
than other types, and/or some types of items can be more numerous 
than other types of items. The rank scores of people can have a 
binomial, Poisson, or any other distribution whatsoever as well as 
the uniform distribution we have used for illustration, and the types 
of items can also have any arbitrary distribution. Regardless, it 
will be true that the first component scores will have exactly the 
same rank order as the rank scores; they will not in general be a 


2° See footnote 2 above added in proof. 

4 The sizes of circles become modified as the number of items changes. In the infinite 
case, the largest size circle (for the metric component) is .5 for our example; the next 
largest is .15, etc., etc. The sizes keep decreasing rapidly for the smaller components, 
but the sum of the infinite series of sizes remains unity, representing the total variation. 
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straight line function of the rank scores (so that rank scores will not 
provide the intrinsic metric) but they will be monotonically in- 
creasing functions of the rank scores. The first component scores 
will represent the various stretchings and contractions of distances 
between ranks needed to provide the intrinsic metric. 

The second, or intensity component, will always be U- or 
J-shaped. It will not in general be a parabola; it may be part of a 
sine or cosine curve, or anything else that has one bend in it. It 
need not be symmetrical about the zero point. If it is symmetrical, 
that means half the population is positive and half is negative. If 
it is not symmetrical, then the population is not split 50-50 on the 
given issue; the majority favors one side. 

Similarly, the third component will have two bends in it, even 
though the equation for it may not be that of a polynomial; the 
fourth component will have three bends; and so forth. 

Components and the sampling of items. The various bending 
points of the component curves are determined by the respective 
frequencies of the types of people and types of items. An impor- 
tant feature is that the bending points are essentially invariant with 
respect to the sampling of items (and of people too) ; if rank order 
on a higher component is plotted against scale rank, the bending 
points will appear at the proper scale ranks. This we have already 
observed for the second, or intensity, component in Chapter 7. 
But the same holds true for the remaining components. If a sepa- 
rate measure of the third component could be obtained” on a sample 
of items, like the separate measure we got for intensity in Chapter 7, 
then plotting the sample ranks on this measure against sample rank 
on content will yield a curve with two bends in it (instead of the one 
bend in the intensity curve), and the bends will always occur at the 
same content percentiles, regardless of the sample of items used. 
These two bending points of the third component are invariant with 
respect to samphng of items just like the zero point of the intensity 
curve. A similar invariance holds for the bending points of the 
fourth and higher components. 

The psychological meaning of the components beyond intensity 
has not yet been determined, but it seems plausible that each bend- 
ing point should indicate a psychologically meaningful dividing 
point for the population, just as the zero point of intensity does. 
When the psychological meaning is discovered for a component, it 


12 See footnote 2 above added in proof. 
" See footnote 2 above added in proof. 
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may be possible then to measure it approximately and independ- 
ently, as we have intensity. 

Because of the invariance of the bending points with respect to 
sampling of items, they provide a possible solution to the problem 
of making inferences about the population distribution of scale types 
and the universe distribution of types of items. We have noted 
before that these two distributions can be shown by independent 
experiment to determine the shapes of the component curves (this 
has been done approximately for the intensity curve). Thus the 
bending points will provide information about the distributions; 
such information will not depend on the random sampling of items. 
The fact that sample items are not random will not prevent valid 
inferences from them concerning their universe, because of the 
invariant property of the sample components. 

“Scores” and “weights.” Thus far, we have concentrated pri- 
marily on quantifying the rank order of people. Let us now turn 
to the quantifying of category values of items. This is a problem 
distinct from that of people, but it turns out to be closely related. 
Either of these two problems can be solved separately without refer- 
ence to the other, but it is convenient for our purposes to present 
their solutions in a manner which utilizes their interrelationships. 

Just as there are values for people that “go in a circle,” so there 
are values for categories that go in a circle. For convenience, we 
have been calling values for people scores here, and values for cate- 
gories weights, in order to distinguish the two quantification prob- 
lems. It should not be inferred, however, that ‘‘weights’’ are in- 
tended a priori only to be added up to provide “‘scores,’’ any more 
than ‘“‘scores’’ are intended a priori to be added up to provide 
“weights.” It so happens that the equations of internal consist- 
ency prove that there is a reciprocal relation between principal com- 
ponents for people and principal components for categories that does 
involve an additive (more precisely, an averaging) process, 80 we are 
justified in part in retaining the traditional nomenclature of “‘scores”’ 
and “weights.’”” From the point of view of quantifying people’s 
ranks, this terminology is quite appropriate. From the point of 
view of quantifying category values, however, the terminology 
should really be reversed; values for people are weights which can be 
used to determine values (scores) for categories. It is convenient 
not to keep switching terminology, so we shall retain the traditional 
terms even in the problem of category values. 

Example of principal component weights. For each principal com- 
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ponent of the scores, there is a principal component of the weights 
with the same correlation ratio and (hence) size of circle. The first 
principal component weights have already been exhibited in Table 
2. If these weights were now used to obtain scores, and the new, 
“circular,” weights were obtained from the scores, the “circular” 
weights would be found to be precisely .6 times the original weights. 

If component weights are known, then component scores can be 
derived from them by the averaging process. For example, if we 
did not know the first component scores in Table 4 but did know 
the first component weights of Table 2, then we could use these 
weights to obtain the desired scores. Similarly, if the scores are 
known first, they can be averaged to yield the weights. In our ex- 
ample, we have been given the solution for all the component scores 
(Table 4). Therefore, we can now easily derive all the component 
weights from these scores. 

The second principal component weights are derived from column 
II of Table 4, just as the first component weights of Table 2 were 
derived from the scores in column I of Table 4. The remaining 
component weights are similarly obtainable from the remaining 
columns of Table 4. There is again a constant principal component, 
represented by weights all equal to unity, corresponding to the con- 
stant score component. The principal component weights, as ob- 
tained from Table 4, are all listed in Table 5. 

Like the scores, principal component weights are determined only 
up to a constant of proportionality. Any column of weights in 
Table 5 could be multiplied through by any constant whatsoever, 
and the resulting weights will also go in a circle, but it will remain 
the same circle as before. 

For the perfect scale pattern, each of the remaining component 
weights has a simple pattern of relationship with the rank of the 
first, or metric, component weights, analogous to score component 
relationships. There is a hop in the metric component weights be- 
tween categories 5a and 1b, where the distance is 2 (the difference 
between 1 and —1), or twice the distance between any other two 
neighboring categories. It is convenient, therefore, to consider 
category 1b as being two ranks below 5a, rather than just one rank. 
In fact, we could consider the intervening rank to be filled by a cate- 
gory of a sixth type of question, namely, one that has all the people 
in it, for which the weight is zero. Then the metric component will 
be a perfect straight line function of the category ranks for this ex- 
ample. 
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The second component is a U-shaped function of the category 
ranks for this example. However, unlike the second component 
scores, the equation for the U is not that of a parabola, but is a more 
complicated expression. Each of the succeeding component weights 
exhibits more oscillation than its predecessor in its relationship to the 
category ranks (the equations for these oscillatory curves differ from 
those for scores). 

Principal components correlate zero with each other. The principal 
components of a scale present an interesting and important paradox. 


TABLE 5 


PrincipaL Componmant WxieuTs FOR THE ScaLu or TABLE 1 


Type of Principal component weight Fr 
category I mu em av Vs Constant. «OF Te#Ponse 
la 5 5 5 1 1 1 1 
2a 4 2 -1 -1 —-2 1 2 
3a 3 0 -2 0 2 1 3 
4a 2 -1 —0.5 0.5 -1 1 4 
5a 1 -1 1 —0.2 0.2 1 5 
lb -1 -1 -1 —0.2 —0.2 1 6 
2b —2 -1 0.5 0.5 1 1 4 
3b -3 0 2 0 —2 1 3 
4b -4 2 1 -1 2 1 2 
5b -5 5 -5 1 -1 1 1 
Size of circle 6 2 1 .06 04 1 
Total frequency 30 


Although the principal component scores are all perfect functions of 
the scale ranks, nevertheless they have perfect zero linear correlations 
with each other. Let us verify this for the scores in Table 4. 

To show that a correlation coefficient is zero, it is sufficient merely 
to show that its numerator is zero. Since the principal component 
scores have zero means, the numerator of the product-moment cor- 
relation coefficient can be obtained by taking merely the products of 
the scores as they stand. For example, the numerator for the linear 
correlation between component scores I and II is computed as fol- 
lows. The first type of person has a score of 5 on both of these com- 
ponents, so the product of his scores is 25. A person of type 4 has a 
score of 3 on the first component and —1 on the second component, 
so his product is —3. The products for the remaining types are 
similarly computed. The product sum, or numerator for the linear 
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correlation coefficient, between components I and II is found to be 
precisely zero: 


(5)(5) + (8)(—1) + G)(—4) + (-1)(-4) + 
(—3)(—1) + (—5)() = 0 


This means that the correlation coefficient itself is zero. Similarly, 
correlations between any two columns of scores in Table 4 can be 
verified to be zero. Thus we have the paradox of uncorrelated but 
dependent sets of scores. 

Similarly, it can be verified that the principal component weights 
of Table 5 have zero linear correlations. Here we have to take into 
account the fact that the categories have different frequencies of 
response. To calculate the numerator of the correlation coefficient 
between weights of the first and second components, we proceed as 
follows. Category la has a weight of 5 on both components and a 
frequency of 1, so its product is (5)(5)(1) = 25. Category 2a has 
a weight of 4 in the first component and a weight of 2 in the second 
component, with a frequency of 2, so its product is (4)(2)(2) = 16. 
The products for the remaining categories are computed in a similar 
fashion, and the sum of the products is precisely zero: 


(5)(5)(1) + (4)(2)(2) + (3))() + (2)(—1)4) + @)(—-D) + 
(-1)(-1)6) + (—2)(-1)@) + (—3)(0)@B) + 
(—4)(2)(2) + (—5)(5)() = 0 


In a similar fashion the correlation between any two columns in 
Table 5 can be seen to be zero. 

It is this property of zero linear correlations but perfect curvi- 
linear correlations that will throw off any attempt at factor analysis 
in the Spearman-Thurstone sense. The Spearman-Thurstone ap- 
proach was devised for a different kind of data; it is not equipped to 
handle configurations of the kind involved in scalogram analysis. 

It is of great psychological importance that we find many linear 
dimensions in a scale, even though there is but a single factor. The 
response of each individual to each item can be viewed in either of 
two ways. The first way is the one thus far used in the preceding 
chapters: the responses are a simple function of the rank order of 
the people. The second way of looking at the matter is the one 
that has now arisen out of our quest for an internal metric: the re- 
sponses to an item are expressible as functions of metric scores, 
intensity scores, and scores on principal components of higher order. 
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The responses have been resolved into many linear components, 
even though they are functions of but a single factor. Each item 
involves the same single factor as any other item, but differs in 
how it combines the components. Each type of item represents a 
different variation on the same theme. Another way of expressing 
this is that each type of item represents a different combination of 
the tones and overtones of the same basic note. 

Different scalable attitudes will have different metrics, reflecting 
the differences in the shapes of the several component curves. Dif- 
ferent intensity structures of attitudes imply different metrics, as 
will different structures of lower order components. 

Thus, we have further evidence of the appropriateness of scalo- 
gram analysis to the study of attitudes and opinions in that it shows 
the deep relationship that exists between unidimensionality and the 
concomitant multiplicity of psychological components. 

Reproducing responses from principal components. Thus far, we 
have shown how the principal components are involved in and deriv- 
able from the item responses. Now we shall show how the item re- 
sponses can be reproduced from the principal components. 

Just as responses of individuals to items can be reproduced from 
the fact that they are simple functions of the scale ranks, so they can 
be reproduced from knowledge of their principal components. The 
procedure for doing the reproducing is different for these two ways 
of viewing the matter. Using the notion of simple function, we 
establish cutting points on the rank order for each item and thereby 
determine whether or not a person falls in a given category. To 
reproduce the responses from the principal components, a simple 
numerical scheme holds. We can reproduce the responses of Table 
1 by means of product sums of principal component weights and 
scores. 

The numerical procedure can be illustrated as follows. Suppose 
we want to know whether a person of type 5 falls in category 2a. 
We know the five component scores for such a person from the first 
row of Table 4: 5, 5, 5,1, and 1. He also has a constant score of 1. 
From Table 5, we know the component weights for category 2a: 
4,2, —1, —1, —2, and the constant weight of 1. Now we have to 
remember that the scores and weights are determined only up to a 
constant of proportionality ; they must be reduced to a standardized 
form for the purpose of reproduction. The standardizing is done 
with the aid of the sum of squares of the columns in Table 4, which 
are indicated in the last row of Table 4. A person of type 5 has a 
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score of 5 on component I, while category 2a has a weight of 4 on 
this component, and the sum of squares is 70. The standardized 
product of score and weight for the first component then is: 
(5)(4)/70. The standardized product of score and weight for 
the second component is: (5)(2)/84, and so on for the remaining 
three components, and also for the constant component. These 
six standardized products are added up and then multiplied by 
the frequency of response of the category, which in this case is 2 
(from the last column of Table 5 or the last row of Table 1). The 
final result turns out to be exactly unity: 


(5)(4) , (5)(2) , ()(—-1) , @(—-)) ,, )(-2) , MM] _ 
2[ 2M , 2) 5 190 + 28 + oa 7+ i) a 


The result will always be unity if the person falls in the category, 
and it will be zero if the person does not fall in the category. For 
each x in Table 1 we would obtain unity as the outcome of the 
standardized sum of the products, and where there is no x we 
would obtain zero. 

As an example of obtaining a zero, let us inquire if a person of 
type 5 is in category 3b. This category has a frequency of re- 
sponse of 3, and if we multiply this by the sum of standardized 
products we obtain zero: 


(6-3) , (5)0) , 2) _, )) , @(-2) , May] _ 
3 OCD + BO) , OO + OO , ED 4 a = 


Hence, a person of type 5 does not fall in category 3b; there actu- 
ally is no x in Table 1 for that position. 

We could test every category for a person of type 5 and we would 
find the standardized product sums to yield unity or zero in the ap- 
propriate places, and similarly for persons of type 4 and the other 
types. The pattern of 1’s and 0’s obtained from the principal com- 
ponents is shown in Table 6. This is identical in form with Table 1. 
The principal components reproduce the original responses exactly, 
just as do cutting points on the scale rank order. 

The distinction between scalability and nonscalability. Every pat- 
tern of responses, whether scalable or not, can be resolved into prin- 
cipal components, and the responses can be reproduced exactly from 
all the components. The distinction between scalable and non- 
scalable configurations lies in the sfructure of the principal com- 
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ponents. Just as it is true for nonscalable data that they cannot be 
reproduced perfectly from cutting points on any rank order of 
people, so it is true that the principal components are not perfect 
functions of any rank order. For a scalable configuration, we 
have observed the neat relationships among the hierarchy of com- 
ponents. All the components are perfect (curvilinear) functions 
of the rank order (and of the first or metric component) ; and they 
have a definite law of formation as to their oscillations. This is not 
at all true for nonscalable data. There is no perfect curvilinear 
relationship between the components. Indeed, they are always 


TABLE 6 


REsponsns OF TABLE 1 REPRODUCED FROM THE PRINCIPAL COMPONENTS 


Type of Type of category 

person ja 2 Sa fa 65a 1b 2b) 8Besié«dSti(ité«i“D 
5 1 1 1 1 1 0 0 0 1) 0 
4 0 1 1 1 1 1 0 0 0 0 
3 0 0 1 1 1 1 1 0 0 0 
2 0 0 0 1 1 1 1 1 0 0 
1 0 0 0 0 1 1 1 1 1 0 
0 0 0 0 0 0 1 1 1 1 1 


uncorrelated just as in the scalable case, but furthermore they have 
no simple pattern of dependence and can actually be completely 
independent. In general, there is no such thing as a U-shaped 
second component which might be identified as intensity, nor a 
third component which has two bends, etc., etc. From the first 
component one cannot know the other components in the case of 
nonscalable data. 

In the scalable case there seems to be hope that the principal com- 
ponents can be identified experimentally, as we have done with the 
intensity function. This might serve as a solution to the problem 
of sampling of items. In the nonscalable case there seems to be no 
solution as yet in sight for the problem of sampling items. There 
is as yet no framework of inference available concerning the rela- 
tionship between a sample of items and the universe of items for 
the nonscalable case, nor any clues as to how to identify or inter- 
pret the components. This again is but a reflection of the fact that 
there is no single meaningful rank order for the general nonscalable 
case. 
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The Mathematical Analysis 


The parallelogram for dichotomies. Let us consider a scalable 
universe of m types of dichotomous items, so that there are m + 1 
types of persons. The types of persons will be numbered in their 
scale order from zero to m. There are 2m types of categories (two 
for each type of item); these will be numbered according to their 
scale order from one to 2m. Then categories j and m + j (jS m) 
belong to the same dichotomy, or we shall say they comprise the 
jth type of dichotomy. 


TABLE 7 


Mararx or a ScaLE oF m Typses or DicHoromixs 


Type Type of category Frequency Frequency 


of of of person 
person 1 2 S3-:>-mm+1m+2 m+8---2m_ response type 
0 1 1 1---] 0 0 0 0 n to 
1 0 1 1-1 1 0 0 0 n n 
2 0 0 1::-1 1 1 0 0 n he 
ar nae ea eee Deana ate one oe p 5 ae 
™m 0 0 0 0 1 1 1 n ta 
Frequency 
of 
response F, Fy, Ps-+++ Fr Fai Fry? Fay. ++: Fam 
Frequency 
of 
category 


types Mi Q2 Qs*'' gm m+1 Gm+a Gm+3'** Jam 


The response of each type of person to each type of item will be 
represented compactly as follows. Let 


1 if persons of type z 
are in category type j, 


0 otherwise, 


The scale matrix defined by (1) is exhibited by the ones and zeros 
in the body of Table 7. 

The parallelogram pattern of ones and zeros can be stated in 
more analytical terms: 
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1 iflsj—isgm, 


0 otherwise, 


It follows that, if e;; = 1, then e; m+; = 0, and conversely, so that: 
;=0,1,-:: 
@) cotemesmt GTQ") 


This states that if a person is in category j, he cannot be simul- 
taneously in category m + 7 (which is the opposite category of the 
same item) ; conversely, if he is in category m + j, he cannot be in 
category j; he must be in one and only one of these two categories. 

Frequencies of types and of responses. The parallelogram pattern 
shows how types of persons behave with respect to types of cate- 
gories. We have also to consider the frequency distributions of 
these two kinds of types. Let f; be the number of persons of the 
ith type (i = 0,1, --- , m), and let N be the total number of people 
in the population. Then 


“ v= Ds 


Let g; be the number of categories of the jth type. The f; and g; 
are indicated in the outer margins of Table 7. Since categories 
belonging to the same type of item occur equally often in the uni- 
verse, we have 


(5) G5 = Ym+i (j = 1,2, ---, m) 


Let n be the total number of items in the universe. Then 


(6) n= ». 9 


Next, we consider the frequencies of responses. Since each person 
is in one and only one category of each item, and since there are n 
items, each individual has n responses. Analytically, this is ex- 
pressed as follows. If e,; = 1, then a person of type 7 has g; re- 
sponses of the category j type. Therefore, his total number of 
responses is obtained by adding up those g; for which e;; = 1 for 
him. If we call this sum G,, then 
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am bad - 
(7) Gz= dae = Ye ges + ten stems 
(i = 0, 1, s+, m) 


From (5), gm + ; can be replaced by g; in the second term on the 
right. Then using (3) and (6) reduces the whole right member of 
(7) to n, so that we have the identity: 


8) n= Si pes @=0,1,--+,m) 


The frequency of response for categories, however, is not con- 
stant like that for people. Each type of category can have a dif- 
ferent number of people in it from any other type of category. Let 
F; be the number of people in—or the frequency of response to—a 
category of the jth type. Then 


@) Free Dy fees G12, +++, 2m) 


The two categories of an item must contain all the N people, be- 
cause each person is in one of the categories of each dichotomy. 
This is seen analytically by the fact that, from (9), 


(10) Fass Dy feimes G12 ++) 


so that adding right members of (9) and (10) and then using (3) 
and (4) yields the identity : 


(11) Fy + Fas3=N, (j = 1,2, --+,m) 


The frequencies of response are shown in the inner margins of 
Table 7. 

The correlation ratios. The two problems of quantification, that 
of finding the most internally consistent numerical values for people 
and the similar problem for categories, can be posed and solved sep- 
arately. Numerical values for categories will be called “weights” 
to distinguish them from the numerical values for persons which are 
called ‘“‘scores.” No implication of a “weighing” or a “weighting” 
process is intended here a priori. 

The basic equations for these two problems have been developed 
previously, for the case more general than the scale pattern. For 


% Guttman, loc. cit. 
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the special case of a scale, we obtain very striking and important 
results. The basic equations will now be repeated here, in the no- 
tation of the present chapter, so that the discussion here will be 
relatively self-contained. 

For the problem of scores, we have to consider the variation within 
columns of Table 7. For a given set of x;, the mean of scores for the 
jth column or type of category is a;, where 


1 bg ‘ 
a; = Fan nie (j = 1, 2, ---, 2m) 


The sum of squares of deviations from the mean for the jth column 
is 8;, where 


bd 
8; = bi 7 9 — a;)*fiess (j = 1,2, -:-, 2m) 


The total sum of squares over all columns, or the sum of the varia- 
tion within all categories, is S., where 


am 
S; = >. 293 


We wish to find the z; which will minimize S, as compared to the 
total variation about the general mean. 
The general mean of the z; can be denoted by A, where 


The total sum of squares of the deviations of the x; from A, sum- 
ming first over 7 and then over j, is seen to be J'., where 


T.=n}) (t— AY: 


We wish to minimize the ratio S./T.. Now the complement of 
this ratio is the square of the correlation ratio for scores for Table 7, 
denoted by nz. The problem is thus equivalent to maximizing 7,, 
where 


a =l- S./T. 
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Therefore, in expanded notation, for the problem of scores, the 
best for reproducing e¢,; is that set of values z,; that will maximize 
the correlation ratio 7. defined by 


> CEE) ENE 


t=-0 a 


(12) ng =1- 


The maximizing scores satisfy the stationary equations obtained by 
differentiating (12) with respect to the x; and setting the derivatives 
equal to zero: 


(13) Die Inna = ner; (i = 0, 1, aaa | m) 


a 


where || A, || is the Gramian matrix defined by 


am a 
(14) 7 (h, i = 0,1, «++, m) 


i 


Similarly, the problem of weights is concerned with minimizing 
variation within rows of Table 7. The correlation ratio for rows is 
determined as follows. For a given set of weights, y;, the mean for 
the ith row of Table 7 is b;, where 


1 2m ae 
b; = n >. 2 Yedeeik (i = 0, 1, , m) 


The sum of squares of deviations from the mean for the ith row is 
r;, Where 


2m 
r= pa _ AC — b;)*g;e; (i = 0,1, ---, m) 


The total sum of squares within rows, that is, within people, is then 


R,, where 
a= Do 


The general mean for the weights is B, where 


1 am 
Bs oe Bele 
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and the total variation around B for all rows is U,, where 
2m 
i et 


The ratio to be minimized is R,/U,, or the correlation ratio to be 
maximized is yy, where 


ny = 1 — R,/U,y 


Therefore, for the problem of weights, in expanded notation, the 
best ones are the set of values y; that will maximize the correlation 
ratio n, defined by 


baad 2m 1 am ; 


(15) Ny am 2: 
1 mm 
> _ Wim apa _ ver F; 


for which the stationary equations are 

2m 
Da _ Y9iBa = W’Fiye = (k = 1,2, ---, 2m) 
where || B;: || is the Gramian matrix defined by 


(17) Bu = ‘)y ZS o fitistik (j, k= 1, 2, ae 2m) 

The equations of internal consistency. As has been shown else- 
where, the optimum scores satisfying (13) and the optimum 
weights satisfying (16) are related. They together also solve a 
third quantification problem that can be posed: find scores and 
weights that are most consistent with each other. This requires 
finding the values z; and y; simultaneously that will maximize the 
correlation coefficient p defined as 


(18) p= 


YY - Faw - LU prosacs 
[eden wen] [QU ne De FOr 


4 Ibid. 
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The stationary equations are: 


2m 
(19) > = Piste = nx; (¢ = 0,1, --+,m) 
(20) ys rihiess = byl; (j = 1, 2, ---, 2m) 
where 


Equations (19) and (20) have been called the equations of internal 
consistency. They are equivalent to both (13) and (16) ; using (20) 
in (19) to eliminate the y; yields (13), and using (19) in (20) to 
eliminate the 2; yields (16). The equations of internal consistency 
state that, for stationary values, the score of each person is propor- 
tional to the mean of the weights of the categories he falls in, and 
the weight of each category is proportional to the mean of the scores 
of the persons who fall in that category. 

Latent roots and vectors. Because of the equivalence of the equa- 
tions, a solution of (19) and (20) with correlation coefficient p must 
yield correlation ratios for (13) and (16) such that 


(22) P= 72 = 77 


and this must be true for each solution. 

The equations have more than one solution. Equations (13) 
and (16) can be regarded as the respective characteristic equations 
of the matrices || f,A,,; || and || g;By./F: ||. The 2; are the elements 
of a latent vector of the first matrix, corresponding to the latent root 
nz*; and the y; are the elements of a latent vector of the second 
matrix, corresponding to the latent root 7,7. The ranks of the two 
matrices are the same; they have the same number of nonzero 
latent roots—all of which are posttive—and these roots satisfy (22). 

There are always the constant latent vectors z; = y; = 1, for 
which the latent roots are unity; but these latent roots are not 
actually squares of correlation ratios. All nonconstant latent vec- 
tors have roots less than unity (these roots really being squares of 
correlation ratios), and these vectors satisfy the respective condi- 
tions 


(23) yy fa = 0 
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(24) Fyj + Fa+ nes = 0, (j = 1, 2, ---, m) 


It is the largest root of the nonconstant latent vectors that is the 
maximum p? (or 7,” or 7,*), and the respective corresponding latent 
vectors—which will be proved to be unique in general for our scale 
pattern—are the optimum scores and weights, or what we call here 
the metric components. 

For further discussion of the solutions of the matric equations for 
an arbitrary pattern of responses, scalable or not, the reader is re- 
ferred to the original paper.'* We continue now with the special 
form the solution takes for the dichotomous scale pattern. In this 
special case, the matric equations transform into an important— 
and apparently new—general type of difference equations. In this 
case, the remaining latent vectors are of considerable interest, as 
well as the metric components. 

The first differences. The matric equations for the dichotomous 
scale transform into a second-order linear difference equation that 
does not seem to have appeared before in its general form, and that 
is of some general mathematical importance apart from scale anal- 
ysis. Both equations (13) for scores and (16) for weights yield this 
form of equation, differing in the coefficients and boundary condi- 
tions. It is perhaps easier to work with (19) and (20), which are 
equivalent to (13) and (16), and that is where we shall begin the 
differencing. 

Taking the first difference of both members of (19) with respect 
to 7 yields 


300 . 
(25) And = YO" yas = 0,1,---,m—1) 


From (2), 
1 jritm+tl, 
om toeta ieee GIR) 
0 _ otherwise, J 
Using (26), (25) becomes 
(27) AnAgi = Yig we Jigme — Yrs ies (i =0,1,---,m—1) 


or, remembering (5), 


u [bid. 
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(28) AnAws = 954 1Yia met — Yeo) @ =0,1,--:,m—1) 


But, from (24), replacing j by 7 + 1, 
(29) Yiomar = —Fya yg (Pig mg ( =0,1,-::,m— 1) 
Using (29) in (28), and remembering (11), we obtain 


(30) Ande, = SNOtry, ( = 0,1, ---,m—1) 


t+m+1 


Next we operate on the y;. Taking the first difference of both 
members of (20) with respect to j yields 


31) wARa= YO” afd = 1,2, ++, 2m- 1) 


From (2), 
1 tj : 
ae 7=0,1,---,m 
ee) Anu me ae Gs 1,2, ee 
0 otherwise, 


Using this result in (31), we find 


Jit; (lsjsgm-1) 
(33) warn \freninn (m+1S 7 8 2m — 1) 
Smtm — foto (j = m) 


The difference equation for scores. For the second differences it is 
convenient to introduce the following notation. Let 


(34) oy = Team (i = 0,1,---,m—1) 
t+ 

and let 

(35) o = N/(ndu) 


Then (30) can be written as 
(36) cAw; = — OUP s+ i+ @ = 0,1,---,m— 1) 


Taking the first difference of both members of (36) with respect to 7 
yields 
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(87) Ax(eAz,) = —OuAFi4yi+. (= 0,1,---,m— 2) 


whence, using (33), we have the desired second-order equation in 
the 2; alone: 


(38) As(cAs) + Ofi4:%141=0, (= 0,1,---,m— 2) 


The boundary conditions for (38) follow from the scale pattern. 
According to (2), 


1, (i = 0) 
(39) ea = 
0, (lsStsm) 


that is, the first category column of Table 7 has unity for a person 
of the lowest type only. Therefore, from (39) and (20), 


(40) By: = foro 
Using (40) in (36), we obtain a lower boundary condition: 
(41) coAcro + dforo = 0 


Similarly, to obtain an upper boundary condition, we notice that 
from (2), 


1, («= m) 
(42) Ci.om = 


that is, the last category column of Table 7 has unity for a person 
of the highest type only. Using (42) in (20) yields 


(43) BY sal 2m = fntm 


Using (24) and (43) in (36), we obtain an upper boundary condi- 
tion : 


(44) Cm — An — itm —1 — Ofntn = 0 


The difference equation for weights. An equation of precisely the 
same form as (38) holds for weights, with boundary conditions of a 
rather different form from (41) and (44). These are obtainable 
from (33). We need concern ourselves only with the first m weights, 
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for the remaining weights are then easily obtainable by virtue of 
(24). 
According to (33) and the notation (35), 


(45) of, Ait is = Anz; G=1,2,-°,m— 1) 


N 

fj 
Taking the first difference of each member of (45) with respect to j, 
and then using (30) and (34), yields the desired difference equation 
in the y; alone: 


(48) a, (7 ar, ' +2 Fiecusr= 0, G=1R-ym—2) 


This is of the same form as (38), with the F,y;, 1/f;, and 1/c; play- 
ing the roles of the z,, c;, and f; 4 1 respectively. 

The boundary conditions for (46), however, have a different form 
from those of (38). The lower boundary condition is obtained by 
solving (41) for x; in terms of 2, and then in terms of y; by means 
of (40): 


(47) ipo BF yi(co = fo) 
cafo 


Then using (47) in (33), we arrive at the boundary condition 


i 


oe) fi 


AiFiys + (é - 7) Fy, = 0 
For the upper boundary condition, we solve (44) for z,,—1 in terms 
of tm, and then we use (43) and (24) to obtain 


(49) tm 1 = Een fn — Om — 1) 


Cu — hm 
Using (49) in (33) yields the boundary condition 
ae 
f m—il 


Conditions (48) and (50) differ in form from (41) and (44) in that 
the coefficients of the last terms have an additive constant. 

Matric inversion by differencing. The difference equation (38) 
and its boundary conditions (41) and (44) are equivalent to the 


me Fan = 0 
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matric equation (13). Any solution of (13) necessarily satisfies 
(38), provided only that it also satisfies (24); this follows from the 
derivation of (28). Only the constant solution does not satisfy 
(24) ; it is easily verified, however, that it in fact does satisfy (38), 
only that now 1/dy equals zero instead of unity for the solution. 

The converse, that every solution of (38) also satisfies (13), can 
be seen by regarding the difference equations as a matric equation. 
Equations (41), (88), and (44) can be written in the respective ex- 
panded forms: 


(51) Lolo — LiCo = foro 


(52) —aaeg + 24 ales + ee) — Seg atsg = fey hs 41 
(@ = 0,1, ---,m — 2) 


(53) Im — 10m —1 + ImCm —-1 = Of min 


Let Cy; (h, 7 = 0, 1,---,m) be the elements of the symmetric 
matrix C defined as 


(54) C= 


—Cm-2 Cm2+Cm-1 —Cm—1 
—Cm-—1 Cm — 1 


Then equations (51), (52), and (53) can be written as the single 
matric equation 


(55) pine EaCns = Of iz; (@ = 0, 1, ---, m) 


In order to help compare (55) with (13), it is convenient to re- 
state both these equations as follows. Let 


(56) & = av fi (¢ = 0,1, ---, m) 
(57) Anus = AnV fil (h,t = 0,1, ---, m) 
(58) Gy = 2G (h, i = 0,1, +++, m) 
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and let 


(59) #= (lal, A= lAadl,  ¢ = |lCull 
Also, according to (35), (21) and (22), we can set 
(60) $ = N/nr* 


The subscript of the correlation ratio can be dropped here and 
henceforth. 
From equations (56) through (60), (13) becomes 


(61) 2A = nf 
and (55) becomes 
(62) 20 = ae 


If none of the g; vanish, it is seen from Table 7 that all scale types 
of persons must occur so that none of the f; vanish. Then, accord- 
ing to (14) and (57), A is nonsingular and Gramian. Hence, there 
are exactly m + 1 independent vector solutions # to (61), corre- 
sponding to m + 1 positive latent roots 7%. Every solution of (61) 
satisfies (62), except for the constant solution. From its order, C 
has at most m + 1 independent latent vectors; m of these are also 
latent vectors of A, but its latent roots are the reciprocals of the 
corresponding ones of A. The constant vector now has the ele- 
ments Z,; = VF; it has the root n? = 1 in (61) and the root 1/n? = 0 
in (62). Hence, every vector solution of (62) also satisfies (61), 
allowing for the difference in latent roots for the constant vector. 

It is well known that the inverse of a symmetric matrix has the 
same latent vectors as the matrix, and that the latent roots of the 
inverse are the reciprocals of the corresponding roots of the matrix. 
C almost fulfills this condition with respect to A. Of course, C is 
singular, since it has a zero root. However, if the zero root were 
replaced by a root of unity, then C would become nonsingular and 
equal to A-!, The replacing of the zero root by unity is accom- 
plished by adding Vf,f;/N to C,; to form the matrix R defined by 


(63) Rus = Cus + VAS/N, (h,i = 0,1, ---, m) 
Then 
(64) R= A> 
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Thus, we have the interesting result that the difference equations 
involve essentially the inverse matrix of the matrix in the original 
stationary equations for scores. 

The inversion for weights. An exactly similar situation holds for 
the equations for weights. It is convenient to change notation 
slightly and to let 


(65) 2; = Fy; (j = 1,2, ---+, 2m) 


Then equations (48), (46), and (50) can be written in the respective 
expanded forms: 


(66) ai(1/fo + 1/fi) — aa/fr = ao 


(67). Se /f Pees + Wha d = n4e/hei= oe 
j= 1, 2, ceeym™m — 2) 


$ 


Cu — 


(68) 2m — Sn -it 2m(1/fn ~it+t 1/fm) = 7 zm 
Let Fj. (j,k = 1,2, --- ,m) be the elements of the symmetric 
matrix F defined as 


-1 1,1-1 


si Aca abiaal Perrrrre) ; 4 
ey a Ae nee 

—1 1 
fect, oui te 


Notice how F differs in form from C in (54) in its first and last ele- 
ments. Equations (66), (67), and (68) combine into the single 
matric equation 


r et = 
(70) pia 2 in =< Pi 7% (k = 1, 2, eee m) 


-1 
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Equation (70) is not directly comparable to (16) because the 
square matrices involved are of orders m and 2m respectively. To 


reduce (16) in terms of a matrix of order m, we first verify from 
(17), (3), and (9) that 


_Fi_ op j= 1,2, +++, 2m 
(71) Brim = Ba oo 
Also, from (24) and (65), 

(72) Ztm = —% (k = 1,2, ---,m) 


Hence, (16) becomes, from (65), (71), (72), (5), (11), and (34), 

(73) YO" e(NBa — FsPi/n)/ey—1 = nes (b= 1,2, +++, m) 
Let: 

(74) 2; = 2,/Vey—1 

(75) Ba = (NBu — FiFi/n)/Vey—sce—-1 (3, & = 1,2, «++, m) 

(76) Fu = 5p PaVesie—1 (9, = 1,2, ---, m) 

and let 

(77) Z=|lel, B= |\Ball,  F = (lFall 

Then (73) and (70) become, respectively, remembering (60), 

(78) 2B = nz 


(79) 2F = 


z 


3) 


It is easily seen that F is precisely the inverse of B. In contrast 
to C, B is nonsingular as it stands, for it has m positive latent roots 
and is of order m. Every solution of (16) which satisfies (24) must 
also satisfy (78). There is one such solution corresponding to each 
nontrivial latent root of A because of (22). Since A has a latent 
root for each of the m nonconstant latent vectors, all of which roots 
are positive, B has these same latent roots and is nonsingular. 
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Then, from (78) and (79), the m latent roots of F are the reciprocals 
of those of B, so that 


(80) F=f 


The latent roots are all distinct. It is of considerable interest to 
know whether or not the latent roots are distinct. As is well known, 
if any roots of a symmetric matrix are distinct, then the correspond- 
ing latent vectors are orthogonal to each other. We shall establish 
the distinctness by showing that any two latent vectors having 
equal roots must be the same vector, which will contradict the 
well-known theorem that any symmetric matrix has as many 
linearly independent latent vectors as its order, regardless of the 
nature of the roots. 

Suppose that u; and v; (t = 0, 1, --- , m) are vector solutions to 
(51), (52), and (53), with the respective roots ¢: and ¢;. Accord- 
ing to (51), 


(81) Co(uo — tx) = Piforo 
(82) Colt — 11) = drforo 


Multiply both members of (81) by u, and of (82) by uw, and sub- 
tract the resulting second equation from the first: 


(83) Co(Uotr — tate) = (h1 — h2)forloro 
Hence, if ¢: = ¢2, then the left member vanishes: 


(84) 


2) v1 


From (52), we have 


(85) uses + Us 4 ales + Ci 41) — Us tig = Oifig sys 

(i = 0,1, ---,m— 2) 
(86) vies + 06 4.i(Ce + Cr 41) — Oe ete gt = Oofig tigi 

(¢ = 0,1, ---,m — 2) 


Multiplying both members of (85) by v; +41, and of (86) by u; 41, 
and then subtracting, yields 


(87) celts 4105 — Us 41) H erg rls 4 E42 — Us 42% 41) 
= (di — da)feg es 4 Wig (@ = 0,1,---,m— 2) 
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If ¢; = ¢:, then the right member of (87) vanishes. Also, for 
t = 0, the first term on the left vanishes according to (84). Hence, 
the second term on the left must also vanish for + = 0, which im- 
plies 


UW Us 
(88) 


v1 vg 


From (88), the first term in (87) vanishes for 7 = 1, so that the 
second term must also vanish for 7 = 1. Continuing in this fash- 
ion, we see that, if ¢, = de, then 


Ui Ui+1 


(89) = 0, (@¢ = 0,1,-+-,m— 1) 


V5 Us+1 


That (89) holds for 7 = m could also have been seen from (53), 
according to which: 


(90) Cm —1(Um — Um —1) = Difmtim 
(91) Cm —1(0m — Um —1) = Prfmtm 


Multiplying both members of (90) by vm, and of (91) by um, and 
subtracting, yields 


(92) Cm —1(UmYm —1 — Um — 10m) = (fi — Pa) fattmdm 
Hence, if ¢1 = ¢:, the left member must vanish, or (89) holds for 
+=. 


It follows from (89) that the matrix 


Us UW oes Um 
1) v1, eee Um 


is of rank one, which is another way of saying that the u; and »v; are 
proportional to each other and define the same (normalized) latent 
vector. 

But this contradicts the fact that to two latent roots, equal or not, 
must correspond two linearly independent vectors. Hence, it must 
be that o: ~ 2, or all the latent roots are distinct. This includes 
the trivial as well as the nontrivial roots. 

Since the roots are all distinct, it follows that all the latent vectors 
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of A are orthogonal to each other, and similarly for the latent vec- 
tors of 8. According to (56), this implies that the score vectors are 
orthogonal with respect to the f;: 


(98) >. _ fms = 


According to (74), (65), and (34), the weight vectors are orthogonal 
with respect to the 9,F;/F; + m: 


(94) > UF Ui = 9 


j=l Fie 


where w, and y; are any two different solutions of (16). 

Sum of the nontrivial roots ts unity. It is well known that the sum 
of the latent roots of a matrix is equal to the sum of the elements in 
its principal diagonal. Hence to obtain the sum of the latent roots 
of A we need only to evaluate the sum of A;; (i = 0, 1,--- ,m). 
From (57) and (14), noticing that e*,; = ¢,;, 


ali VW 05 Gano... 

(95) Aunt YO St = 01m) 
Summing both members over 7, and using (9), we obtain 
(96) >». Au es L ” os 


Hence, from (5) and (6), 


(97) Dy da? 


The trivial root of unity is Included in the sum of all the latent 
roots implied by (97). Therefore, the sum of the nontrivial roots 
is unity. This means that the squares of all m correlation ratios 
add up to unity, or, the total variance, taken as unity, has been 
divided into m additive parts. 

The alternating signs of bending points. The difference equations 
provide an easier means of studying the nature of the principal com- 
ponents than do the matric equations. Through them, we can now 
show the oscillatory nature of the latent vectors, both for scores and 
for weights. We begin with the scores, and shall show first how 
each bending point in the curve of a principal component, like those 
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portrayed in Figure 1, must be of the opposite sign from its prede- 
cessor. 

Suppose that x, is a convex bending point, defined by the condi- 
tions 


(98) Ze 2 le~1, Le > Ta41 
This implies 

(99) Awe-120, Agta <0 
so that 

(100) Coots — Ca—1Agta_-1 < 0 


Hence, from (100) and (88), setting i + 1 =a, 
(101) Za > 0 


In exactly an analogous manner, if x, is a concave bending point de- 
fined by 


(102) Xp SX_1, << %41 
it follows that 
(103) a <0 


Since bending points must be alternately concave and convex, it 
follows that successive bending points alternate in sign. 

In particular, (101) and (103) show that no bending point can 
vanish. 

We shall further show that the first internal bending point must 
be opposite in sign to the first point, 2, and that the last internal 
bending point must be opposite in sign to the last point, tn. 

First, we notice that 2) can never vanish, for then 2, would vanish 
according to (51), and then the remaining 2; would all vanish ac- 
cording to (52). Since latent vectors are determined only up to a 
constant of proportionality, there is no loss of generality if we set 


(104) my =1 
Then (51) becomes 
(105) co(l — 11) = Ofo 
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Since the right member of (105) is positive, it must be that 


(106) a1<l 


This means that, if the metric of (104) is used for all the principal 
components, each of their curves starts out with a downward slope, 
so that the first bending point (if there is any) must be concave. 
But all concave bending points are negative, according to (103); 
and 2 is positive, according to (104). Therefore, the first bending 
point must be opposite in sign from 2p. 

In an analogous manner, we find that z,, can never vanish, and it 
must be opposite in sign to the last turning point. 

Therefore, for any nonconstant principal component, if the z; are 
plotted as a function of 7 as in Figure 1, and if the points are con- 
nected by a polygon, then the polygon must cross the 1-axis between 
each two successive turning points, counting the first and last points 
also as turning points. 

Proof of the number of oscillations. Let us consider again two solu- 
tions, u,; and v,;, with their respective—and distinct—roots ¢; and ¢». 
Let us suppose that ¢, is the larger of the two roots, and let 5 be the 
difference, so that 


(107) 5=h-—G>0 


We want to prove that the curve of the u,, plotted as a function of 7, 
bends at least once more than does that of the v;, That is, we want 
to show that if one set of component scores has a smaller correlation 
ratio than the other (and hence a larger ¢), then it also has at least 
one more oscillation in its curvilinear relation to 1. First we shall 
show that the u,; and v; cannot change in sign equally often, and 
hence cannot have the same number of turning points. Then it 
will be seen that?” the u; bend more often than do the »;. 
Let t; be defined by 


(108) i= Cs(U05 + 1 Uy 3), G = 0, 1, soem — 1) 
Then (83), (87) and (92) can be written, respectively, as 


(109) ty = Sfouovo 


17In this paragraph we tacitly assume that neither u nor v is the constant vector, 
so that their latent roots correspond to actual correlation ratios. But the constant 
solution has no bends, and neither does the metric vector, so these two solutions “‘oscil- 
late” equally often. However, the constant solution has no sign changes, whereas the 
metric vector has one sign change. The proof that follows includes the constant vector. 


854 COMPONENTS OF SCALE ANALYSIS 
(110) bai te = Ofc 4 eg 41 (¢ = 0,1, ---,m — 2) 


(111) —tn 1 = Ofmitntm 


Suppose that the u,; and v; change sign equally often. Then with- 
out loss of generality, since endpoints cannot vanish according to 
the preceding section, we can set: 


(112) UM > 0 
(113) Unt > O 
Let a be the first value of 7 for which uv; < 0, so that 
(114) um20, @=1,2,--+,4a—1) 
(115) Uva < 0 


That there must be an a < m satisfying (114) and (115) follows 
from the fact that the u; and »,; are orthogonal as in (93). 

From (109), (112) and (110) it follows that ¢,-, > 0, whence, 
remembering (108), 


(116) Ue — We > Uda —1 


Dividing both members of (116) by u.v., remembering (115), shows 
that 


Ua —1 Va -1 
(117) a < a 


Therefore, if v, and v, — 1 are opposite in sign, then also u, and u, —1 
are opposite in sign; but this contradicts (115) and (114) when 
1=a-—1. Hence, according to (115) and (114), it must be that 


(118) tte —~1la SO , Va-1ta 2 0 


At most, only one of the equalities in (118) can hold, for according 
to (117), u.—1 and »,—; cannot vanish simultaneously. If v1 
= 0, then there is a change in sign between u._1 and uw. If 
U,—1 = 0, then v,—, and », have the same sign; but u.—: and u, 
have opposite signs as is easily seen from (85) when? +1 =a — 1; 
hence also », — : and », must have the same sign to conform to (115) 
and to (114) when i = a — 2. If neither u,—, nor v, — 1 vanishes, 
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then (118) shows a change in sign between u,—.1 and u, but no 
change between v, — ; and 2,. 

In any event, then, whether or not u,—1%—1 = 0, the u,; have 
one more sign change than the »; in the interval 0 $7 Ss a. 

Let 6 be the first value of 7 after a for which uw; > 0, so that 


(119) uw, S 0, G@=a+1,4+2,---,b —1) 
(120) uw, > 0 


Two possibilities need be considered. Either é,., < Ooré,-1 2 0. 
Suppose that t,.1< 0. Then from (108) 


(121) Uy — 109 < Uys —1 (t ~1 < 0) 


Dividing both members of (121) by u,v, shows that 


(122) Mare M=1 (1 <0) 
Up Vs 


Reasoning as with (117), but using (119) and (120) instead of (114) 
and (115), shows that the wu; must have one more sign change than 
the v; in the intervala +1 S57 6. 

If t_1 2 0, then instead of (122) we have 


(123) Mats %=1 %, 120) 


Up” = 
whence the », can possibly have one more change than the wu; in the 
interval a + 1 37% Sb, so that the u; and »,; could have the same 
number of sign changes in the interval 0 $7 <b. In this case, it 
must be that b < m in order that (111) and (113) be satisfied. 
Then we can go on toc, the first value of 7 after b for which uv; < 0: 


(124) uw, 2 0, @=b+1,b4+2,---,¢-1) 
(125) Uve <0 


That there must be a value of ¢ > 6 for which uv; < 0 follows from 
the fact that ¢; > 0 for all 7 > b such that uv, 20 (h = 6 +1, 
+++, 1%), according to (110), (120) and the hypothesis that t, 1 2 0, 
yet (111) and (113) must hold. In particular, 


(126) t-1> 0, (ts -1 2 0) 
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Hence, from (126), (108), and (125), 


(127) ra < ee (t 1 2 0) 


Equations (124), (125), and (127) are precisely of the same form as 
(114), (115), and (117) respectively. Hence, the u; must have one 
more change of sign than the »; in the intervalb +1 SiS. 

Hence, whether t, _1 < 0 oré,—, 2 0, the uv; must change in sign 
more often than the »; in the interval 0 $i Sc. Continuing in 
this fashion, it follows that the u; change in sign more often than 
the »; in the interval 0 < 7 < m, which contradicts the hypothesis 
that the wu; and v; have the same number of sign changes. There- 
fore, no two different latent vectors can have the same number of sign 
changes. 

There being but m + 1 elements in a latent vector, no such vector 
can have more than m changes in sign. But there are m + 1 latent 
vectors, no two of which can have the same number of sign changes. 
Hence, there must be a vector with no sign changes, a vector with 
one sign change, a vector with two sign changes, etc., and a vector 
with m sign changes. Furthermore, a review of the proof shows 
that a vector with a smaller correlation ratio than another has the 
larger number of sign changes. 

The vector with no sign changes is the constant vector. The 
vector with one sign change also has no bends, and is the metric 
vector for the scale, being a monotonic function of the rank order. 
The vector with two sign changes must have one bend, and is identi- 
fied as the intensity function. The vector with three sign changes 
must have two bends; that with four sign changes must have three 
bends; etc. The vectors with two or more bends are yet to be 
identified psychologically. 

The oscillations for weights. The weight components oscillate in 
a manner similar to the score components. The difference equation 
(46) is the same in form as (38), but the boundary conditions vary 
slightly, so the nature of the oscillations is not quite the same. 
Also, we have to consider all the 2m elements in a weight component, 
instead of the m + 1 elements in a score component. 

How the weights oscillate can be seen from equation (30). The 
vector A,z; (¢ = 0, 1, --- ,m — 1) must have one less change in 
sign than the vector z; (¢ = 0, 1,---,m). But from (30), the 


18 See footnote 2 above added in proof. 
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weight vector y;41: (¢ = 0, 1,-++,m — 1) changes sign whenever 
A,x; does. Therefore, the first m elements of the weight component 
y; (j = 1,2, +--+ , 2m) change in sign once less than do the elements 
of the latent score vector belonging to the same correlation ratio. 
The last m elements change in sign similarly, by virtue of (24). 

Particular solutions: the classical orthogonal functions. Difference 
equation (38), with its boundary conditions (41) and (44), brings 
to mind the classical Sturm-Liouville boundary value problem, 
which has as particular solutions the Legendre polynomials, Hermite 
polynomials, Laguerre polynomials, etc. These classical functions 
are orthogonal, and oscillate much as do the solutions to (38). 
Actually, these classical orthogonal functions can be seen to be ob- 
tained in the limit from solutions to (38) as m becomes indefinitely 
large. 

The Sturm-Liouville problem” concerns the solutions y(z) to the 
differential equation 


2 | ete) #4] + anern) =0 


over an interval a < x S$ b, where c(z) and f(z) are given non- 
negative functions and ¢ is a constant, with boundary conditions 


e(a)y'(a) + $f(a)y(a) = 0 
c(b)y’(b) + of(b)y(b) = 0 


It is clear that our difference equation problem is a discrete analogue 
to the Sturm-Liouville problem, which concerns continuous func- 
tions, so that it should not be surprising that the limits of our solu- 
tions should be solutions to the continuous case. 

The general discrete case does not seem to have been noticed be- 
fore, although special cases have been studied. Tchebychef® stud- 
ied a particular discrete solution which in the limit becomes the 
Legendre polynomials. The Krawtchouk polynomials*! are a solu- 


WE. L. Ince, Ordinary Differential Equations (Longmans, Green and Co., Ltd., Lon- 
don, 1927), p. 223-253. 

» “Sur une nouvelle série,” Oeuvres, 1, pp. 381-384; “Sur l’interpolation,” ibid., 1, pp. 
541-560; “Sur l’interpolation par la méthode des moindres carrés,” tbid., 1, pp. 473-498; 
“Sur l'interpolation des valeurs équidistants,” ibid., 11, pp. 219-242; “Sur une formule 
d’analyse,”’ ibid., 1, pp. 701-702. 

For the approach to the Legendre polynomials, see G. Szegé, Orthogonal Polynomials 
(Colloquium Publications, Vol. 23, American Mathematical Society, New York, 1939). 

"See M. J. Gottlieb, “Concerning Some Polynomials Orthogonal on a Finite or 
Enumerable Set of Points,” American Journal of Mathematics, Vol. 60, No. 2 (April 
1938), pp. 453-458. 
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tion to (38) which in the limit yield the Hermite polynomials. 
Charlier developed a series of polynomials which can be seen to 
satisfy (38), and in the limit yield the Poisson-Charlier poly- 
nomials.* Gottlieb* developed a series of polynomials which 
also can be seen to satisfy (38), and which in the limit yield the 
Laguerre polynomials. 

None of these treatments of particular solutions seems to have 
recognized the general case expressed by equation (38). Perhaps 
the reason for this is the fact that the second term in (38) involves 
4+ 1 rather than 7. For example, Tchebychef’s case is that for 
which (in our notation) f; = 1, g; = 1, so that 


c¢=@+1I)(N-t-1), @=0,1,---,n) 


and N =m-+1. Expanding the first term of (38) with these 
values of c; we can rewrite (38) as 


(i + 2) (N —¢ — 2) Aga; + (N — 21 — 3) Aw, + Ori41=0 


Tchebychef, however, expressed the difference equation in the 
form™ 


(i + 2) (N — i — 2) Ava, + (N — 21-34 ) Acai + Gx, = 0 


Using this form as a starting point, it is not too obvious that a 
generalization like (38) is to be obtained; the temptation is strong 
to keep x; in the last term on the left and not to use x; 4,1. It seems 
fortunate that the problem of scale analysis led to the general equa- 
tion (38) in a natural fashion. 

The numerical example used in the first half of this chapter is 
Tchebychef’s solution for m = 5 (N = 6), since Tchebychef’s case, 
in our setup, is that of a rectangular distribution for types of people 
and for types of items. Nonconstant values for the f; and/or the 
g; yield different solutions, including the various sets of orthogonal 
polynomials previously referred to. 

In general, however, the orthogonal functions obtained from non- 
constant frequencies need not be polynomials. For example, a par- 
ticularly simple solution is for a case which is well known in the 


3 Ibid. 
3 Ibid. 
™“ “Sur l’interpolation,” op. cit., p. 557. 
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theory of difference equations. If f;=1 and c;=1 (that is, 
Os41 = Fy 4i.F i441), then (88) and its boundary conditions 
become: 


Aga, + O24 41 = 0, (i = 0,1, ---,m — 2) 
Acts + x = 0 
Antn -1 —- bin = 0 


These equations traditionally have been written in expanded form, 
rather than in the above form, again perhaps because of lack of 
recognition of a possibility of a generalization like (38). The solu- 
tion consists simply of cosines.* 

Reproducibility from the components. Just as the principal com- 
ponent scores and weights are derived from the item responses, just 
so the responses can be reproduced from the components. This 
reproducibility, in fact, holds whether or not the e,; form a scale 
pattern, as we shall see from (20). 

In order to discuss all the components simultaneously, it is neces- 
sary to modify previous notation. Let mo = 1, and let m, m, --- , 
nm be the respective m correlation ratios for the m principal com- 
ponent vectors satisfying (13). (If desired, it can be assumed that 
m>™ > --- > mm so that the latent vectors have the order: 
constant vector, metric vector, intensity vector, etc.) Let 2a; be 
the ith element in the vector corresponding to 7. (a,7 = 0,1, --+,m). 
Similarly, let y.; be the jth element in the latent vector satisfying 


(16) with n, = 7. (a = 0, 1,---,m; 7 =1, 2,---,2m). Then 
(20) can be written as 

ns = Pp. a=0,1,°:-,m 
28) YP ahi = maya §— (FID) 


According to (93), if a = b, then the 2,; and 2;; are orthogonal 
with respect to the f;. Since the vectors are determined only up to 
a constant of proportionality, we can normalize them so that 


(129) Dd fet=1 @=0,1, +++, m) 
Then, from (98) and (129), 


% Theodore VY. Karman and Maurice A. Biot, Mathematical Methods in Engineering 
(McGraw-Hill Book Company, Inc., New York, 1940), pp. 453ff. 
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(180) Dy. feta = Ban (@, = 0,1, ++, m) 
where 8, is Kronecker’s delta. Equations (130) imply that the 


matrix || 2.:Vf; || is orthogonal, so that also its transpose is orthog- 
onal: 


(131) Die Bata Iahi = 61; (h,t = 0,1, +++, m) 


Multiply both members of (128) by 2esVf, and sum over a. 
Using (131), and rewriting subscripts, we see that 


(132) ei; = >. ‘ reas 
where 

= : a=0,1,-: *,>m 
(133) Wai = Mal” Yoj G = 1,2,+--, ) 


Equations (132) show how the response of the ith type of person 
to the jth type of item is a linear function of his principal component 
scores. Each type of category has its own set of weights w,;, and 
each type of person has his own set of scores. The responses to the 
items can be reproduced algebraically from the weights and scores 
according to (132), just as well as they can be from the scale pattern 
defined in (2). According to (2), the item responses are a simple 
function of the rank order of people; according to (132), the item 
responses are a linear function of m principal components. Here is 
a paradox, that in one case reproducibility is from but one variable 
whereas in the other case reproducibility is from m variables. The 
paradox is resolved by remembering that the principal components 
in turn are perfect functions of the rank order. 

A resolution into components like (132) is obtained whether or 
not é,; is a scale pattern. The difference between the scalable and 
nonscalable cases is that, in the latter, the principal components are 
not in general perfect functions of a rank order. 

External prediction. An important property of a scale is the ease 
with which the predictive power of its items can be used for any 
arbitrary outside variable. Just as the scale ranks can perfectly 
reproduce each item in the scale, just so are the scale ranks perfectly 
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efficient for any external purpose. It is easily seen that the condi- 
tional distribution of any variable on the items in a scale 1s precisely 
the same as its conditional distribution with the scale ranks. Indeed, 
this theorem borders on being a tautology. A conditional distri- 
bution with the items themselves is a set of conditional probabilities, 
one for each combination of item responses. But each combination 
of item responses has but one scale rank. Conversely, each scale 
rank corresponds to but one combination of responses. Hence, the 
set of conditional probabilities for scale ranks is identical with that 
for item responses. This means that the multiple regression of any 
outside variable on the items is equivalent to its simple regression 
on the scale ranks. The same predictions and the same errors of 
predictions are obtained either way. 

The resolution into components expressed by (132) adds an inter- 
esting view to the problem of external prediction. In the case of 
linear regression of an outside variable on the items, the prediction 
for the ith type of person is of the form 


2m 
(134) p= Di des = 0,1, +++ m) 


where b; are regression weights. According to (132), then, 


(185) n= Ds Fatwa! (@ = 0,1, +--+, m) 
where 
(136) Wwe! = » bavey (9 = 0, 1, +++, m) 


Since the z,; are orthogonal (with respect to the f;), each w,’ is pro- 
portional to the zero-order correlation of the outside variable with 
Zax. If p;is a linear function of 2;, then wa’ = Ofora = 2,3, ---,m. 
That is, of the regression is a linear function of the metric component, 
then the regression weights for the intensity and further components are 
all zero. The intensity and further components get nonzero weights 
only when the regression on the metric component is curvilinear. 


CHAPTER 10 


THE LOGICAL AND MATHEMATICAL 
FOUNDATION OF LATENT STRUCTURE 
ANAL YSIS* 


Introduction 


HAPTERS 10 and 11 describe the latent structure approach to 
C the treatment of itemized tests. Each chapter is divided into 
three sections. The first section of Chapter 10 develops the general 
theory of ‘“‘trace lines” and introduces their ‘equivalent latent struc- 
tures.”” This section is written primarily for the nonmathematician. 
In the second section the general algebraic relationships between the 
latent structure and manifest (empirical) test data are derived. The 
third section shows in a number of specific cases how the latent 
parameters can be obtained from manifest data if certain conditions 
of reducibility are satisfied. 

In Chapter 11 numerical examples are introduced using Research 
Branch data. Section IV deals first with a latent dichotomy and 
discusses in detail problems of testing empirical data against a 
presumed latent structure. Next the techniques are applied to a 
special case of a latent trichotomy. The latter is shown to be 
closely related to a Thurstone attitude scale. In the second sec- 
tion data are investigated which correspond to a “‘social distance”’ 
test and to Guttman’s scalogram pattern. There also an example 
is given which permits discussion of the empirical handling of sev- 
eral latent characteristics. In the third section the relation of the 
latent structure theory to the general problem of scaling is further 


! By Paul F. Lazarsfeld. The investigations presented in this and the following chap- 
ter were developed in connection with work the writer did as a consultant to the Research 
Branch. Some of the general ideas were discussed between the writer and Professor 
8. A. Stouffer in connection with earlier joint research efforts. One cannot overestimate 
the influence which these many years of intellectual cooperation have had on the present 
text. No further effort will be made to trace in detail all the help and advice Dr. 
Stouffer has given during the writing of Chapters 10 and 11. 
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discussed. This provides an occasion to consider a number of other 
measurement problems as they have appeared in the literature on 
itemized tests. 

The six sections in Chapters 10 and 11 are numbered consecutively. 
An equation numbered 4.7 is therefore equation 7 in section 4. 


SECTION I 


GENERAL BACKGROUND 
l]. THE NOTION OF TRACE LINES 


The practice of testing can be roughly described as follows: The 
investigator assumes that a one-dimensional continuum exists, such 
as soldier morale, anti-Semitism or intelligence. People are as- 
sumed to be arranged on this continuum in an unknown way. 
Then, persons are exposed to a series of test items. They can an- 
swer questions with yes-no, true-false, agree-disagree, etc.; or they 
can be observed as to whether they do or don’t do certain things; 
or they can be given arithmetic problems to see whether they can 
or cannot solve them. Any verbal or other behavior which permits 
a dichotomous classification of people will hereafter be called a test 
item. 

The investigator chooses items which seem to him relevant to the 
problem. In a test of soldier morale he will not usually include a 
problem of integral calculus, and in an intelligence test he will not 
include a question as to whether the respondents like the food they 
are getting in the Army. (The reader will remember that we are 
giving a description of prevailing practice and not discussing how 
tests should be or would be constructed.) 

After the test items are selected, they are given to a sample of 
people. The purpose of giving the test is obviously the following 
one: From the distribution of ‘‘answers” to the items we can, it is 
hoped, make inferences as to the position of the people on the as- 
sumed continuum. While we might not be able to say metrically 
where each respondent belongs on the continuum, we want at least 
to rank people according to their position there. The scoring sys- 
tem developed for a test always implies certain assumptions, how- 
ever vague, about the relations between performance on the items 
and the hypothetical continuum. Because the “‘response patterns” 
of the people are obtained from actual experiments, we shall call 
them manifest data; all the information inferred as to the nature of 
the continuum or the position of people thereon we shall call latent. 
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Nothing more is implied in this terminology than the distinction 
between information obtained by direct observation and information 
obtained by using additional assumptions and/or by making infer- 
ences from the original data. 

We shall now translate the foregoing description into more pre- 
cise terms. The continuum we shall call x and represent it on the 
z-axis of a coordinate system. Nothing will be presumed about this 
continuum for the time being other than its existence, and even that 
will be made subject to empirical inquiry later. 

Let us look at specific test items. Suppose in connection with 
studying the effect of an indoctrination film we want to test the 
“ethnocentrism” of American soldiers. By this we mean vaguely 
the extent to which they think that only Americans are of value 
and that the actions and ideals of the war allies are of little value or 
importance for the war effort. Suppose the following three items 
have been proposed for inclusion in the test: 


1. I believe that our European allies are much superior to us in strategy 
and fighting morale. Yes No 


2. The majority of all equipment used by all the allies comes from 
American lend lease shipment. True False 


3. Neither we nor our allies could win the war if we didn’t have each 


other’s help. Agree Disagree 


All three items were included in the test because they are related 
and partially express ethnocentrism the way we have conceived it 
to begin with. This fact can be formulated more precisely in the 
following way: We assume that the probability y, that respondents 
check the first alternative in the 7th question, is a function f(x) of 
their position on the continuum 7, their degree of ethnocentrism. 
This formulation can be graphically represented in Figure 1. The 
graphical picture of our functions f;(7) we shall call the trace line of 
item 7. 

The way these trace lines have been drawn can be expressed ver- 
bally as follows: According to Figure 1, item 1 is likely to be con- 
sidered true only by people with very low ethnocentrism. Among 
persons with at least some ethnocentrism, hardly anyone will con- 
sider the statement true. Item 2 is more likely to be answered 
“tyes” the more ethnocentric a respondent is. There is no special 
break in trace line 2: the probability y goes uniformly up with z, 
and the trace line can be expressed by the equation y = a + bz. 
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The third item is the typical ‘middle item” of any attitude test. 
Its trace line goes down toward the two extremes. The probability 
that the statement will be agreed with is highest in the middle range 
of the continuum z. Trace lines of this type obviously can vary 
greatly in the steepness of their peak. 

In this discussion we shall deal only with dichotomous questions. 
(If questions permit three or more qualitative answers, the data 
can be reduced to a set of dichotomies, as will be shown at the end 
of Chapter 11.) No limitation is set on the kind of dichotomies 
which can be used. It would not change the theory, for instance, if 


Y 


probability of 

answering the 

first alternative 
of a series of 

questions. 


- 00 <— X ethnocentrism oe 
Figure 1. Trace lines of 3 items in a test of ‘“ethnocentriam.” 


some of the items were observations to the effect that each person 
did or did not perform a certain act or own a certain object. It also 
is a matter of indifference which of the two answers is plotted in a 
trace line as long as the same practice is maintained all through the 
analysis. The answer which is used to formulate the trace line will 
be called the “positive answer.” In this sense it can then very well 
be that we decide in a specific context to use the answer “ho” as the 
“positive” alternative. The term ‘positive answer” will refer here- 
after to the answer which is used to compute probabilities. 

It is, of course, not assumed at this point that we actually know 
the trace lines for items. The contention is only that this whole 
presentation is an acceptable formalization of the reasoning which 
goes on when an item is included in atest. In practice, most of the 
time, we merely have a vague feeling that the trace line goes up or 
down from left to right or has its peak in the middle. (It will be 
shown later, however, that it is possible to make a good guess in 
advance as to whether the trace line “breaks” or whether it has a 
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linear shape.) Moreover, we have at this point discussed only the 
geometric shape of the trace lines and not their position up and 
down on the y-axis. 

The gist of the discussion up to this point is as follows: 


a. Items are included in tests because they are supposed to have some kind 
of relationship to the underlying continuum which is to be measured. 


b. The reasoning of the test builder consists of a series of expectations. 
He expects people to be distributed over the underlying continuum; 
at each level of the continuum, he thinks, people will have a different 
probability of answering a test item “‘positively.”’ 


c. If the test builder could express his reasoning with perfect clarity and 
in full detail he would set down for each item a trace line. This would 
represent a mathematical function relating to each level of the con- 
tinuum a definite probability for a positive answer. 


d. In a concrete case, the actual reasoning of the test builder who chooses 
an item can usually be translated into his assuming some more or less 
precise properties of a hypothetical trace line. 


2. AGGREGATES OF ITEMS WHICH FORM A “PURE” TEST 


Although there is, of course, no way to verify such a set of as- 
sumptions directly, they represent what we think is a plausible 
description of the process of test building. But the description is 
made in mathematical terms. It leads to a model from which, as 
will be seen later, rather important mathematical inferences can be 
drawn. ‘The inferences can be judged as to whether they are right 
or wrong. The original “formalization” can be judged only as to 
whether it is reasonable and useful. 

The test builder not only makes assumptions about each separate 
item; he also judges the items in their interrelationships. This 
judginent enters in a variety of ways. Suppose a large number of 
items has been proposed for inclusion in a test of ethnocentrism. 
Someone is sure to ask how we know that this is a “unitary atti- 
tude.” Maybe American soldiers are convinced of American tech- 
nical superiority but believe in British cultural superiority. Should 
we have just one test or two tests? Or, suppose that among our 
questions there are a few which express practically the same idea 
with only minor changes in wording. Wouldn’t a respondent who 
approves of such a particular idea get a much higher score than an- 
other respondent who prefers to express his ethnocentrism through 
another idea? One may also inquire how we know that an aggre- 
gate of items is a “pure” test of ethnocentrism; might it not meas- 
ure ethnocentrism and something else? 
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All such considerations can, as will be seen, be reduced to an 
analysis of the interrelations between the items. Actually certain 
expectations as to these interrelationships are already implied in the 
preceding discussion. Let us take items 1 and 2 in our test of ethno- 
centrism. By and large, the items were picked because we assumed 
that soldiers high in ethnocentrism would think that most allied 
equipment comes from America and that European armies are not 
superior to American ones. (This expectation is expressed in the 
trace lines of items 1 and 2 in Figure 1.) Therefore, the answers to 
the two items will show a correlation—in this case a negative rela- 
tionship. Respondents giving a positive reply to item 1 will be 
more likely to give a negative answer to item 2. We shall now call 
a pure test of a continuum z an aggregate of items which has the 
following properties: All interrelationships between the ttems should 
be accounted for by the way in which each item alone is related to the 
latent continuum. 

Before giving this idea a mathematical formulation, some addi- 
tional remarks are necessary. No requirement has been made that 
we should in practice always use pure tests. The notion is intro- 
duced only to make precise a number of distinctions which come up 
in practical work. Suppose ethnocentrism divides itself into two 
dimensions—cultural and technical. We should then be able to 
separate those items which are pure measures of the one and those 
which are pure measures of the other dimension; they would be 
those items which have flat horizontal trace lines with the one dimen- 
sion they do not measure. Some of the items might well measure 
both dimensions. That is to say, their trace lines on both continua 
might be nonhorizontal. Nothing at this point permits us to decide 
whether it is a good idea to combine the three types of items in one 
test. The only purpose of the discussion is to find a way by which 
we can recognize what kind of aggregate of items we are treating. 
In a similar way, we would want our analysis to clarify what is 
meant by group factors in itemized tests, what is meant by biased 
questions, and so on. It will turn out that all of that can be 
achieved by comparing an actual aggregate of items with the special 
case of a pure test. 

The crucial point is to obtain a precise formulation for the idea 
that the interrelationship between items should be completely ex- 
plained by the existence of one underlying continuum. Scrutiny of 
research practice suggests that this be formulated as a negative 
statement. If a group of people has the same z-value—i.e., the 
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same degree of ethnocentricity—then, with ethnocentricity held 
constant, nothing else “holds the two questions together.”’ Thus, 
for all people with any specific z-value, z., we have a fourfold table 
like the following: 


Item 1 
Do not 
Endorse endorse 
Endorse i) 
Item 2 
Do not 1 
endorse er 
Pi l1-p 1 


In a pure test, there is no association between responses to the two 
items; therefore, the entry within the upper left-hand box, which 
we call pu, is equal to pi X pe. In terms of trace lines, for any 
x = x., the probability of a joint positive answer fi:(x) is the prod- 
uct of the independent probabilities for the two items under dis- 
cussion : 


Jis(Ze) = fal%e) fa(te) 
But this should of course be true for every point of the continuum. 


So we come to the following definition of a pure test: It is an aggre- 
gate of items such that the trace line for a joint positive answer to 


several items 1, 7, k, --- is the product of the trace lines of every 
item taken separately. 
(1.1) Size... (2) = Fil) f(z) filz)- +> 


In actual experience, of course, it can easily happen that equation 
(1.1) is not fulfilled. There might be a little detail in the wording 
of two questions which “‘ties them together’ beyond what can be 
accounted for by the underlying continuum. Thus for people who 
have the same 2z-value, we would find that f::(z) is greater or less 
than fi(z) f2(x). In this case, we usually say that there has entered 
an additional factor, that we do not have a pure measurement of the 
continuum xz. One should be sure not to think of equation (1.1) as 
describing manifest data obtained from actual test performances. 
If in the whole sample p:: per cent gives a positive answer to items 


LATENT STRUCTURE ANALYSIS 869 


1 and 2, this value of pi: in general will not equal p: X pe. Thus is 
raised the decisive question of the relationship between trace lines 
and manifest response patterns in a concrete test situation. 

But first it will be helpful to summarize the second step of the 
discussion which was just developed. This would be a sequence 
continuing the summary given at the end of paragraph 1. 


e. It is possible to formulate mathematically what is meant if we say that 
an underlying continuum accounts for the interrelationship of two test 
items. 


f. Such a formulation reduces to this idea: If people have the same posi- 
tion on the underlying z-continuum, then their answers to the two 
questions will be unrelated; the probability that they will answer two 
questions positively is then the product of the probabilities that they 
will answer each question alone positively. 


g- This leads to the definition of a pure test: It is an aggregate of items 
such that the joint positive answers to any number of items have them- 
selves a trace line which is the product of the original trace lines for the 
items viewed separately. 


3. THE ACCOUNTING EQUATIONS 


In our original discussion it was already implied that each re- 
spondent has an (unknown) degree of ethnocentricity (x). The 
total sample is therefore characterized by a distribution function 
¢(x) which gives for each small interval dr the number of people 
¢(x)dx whose score lies in this interval. We now can tell what 
proportion of respondents in the whole sample will give a positive 
reply to item 7 with trace line f;(x): 


(1.2a) f G(x) filx)dz = pi 


The latter equation deserves a brief discussion for the nonmathe- 
matician. The symbol p; stands for the proportion of people in the 
whole sample who check the first answer in each of our three items. 
This is usually called the marginal of the item. To indicate that p,; 
comes from actual observation, we shall sometimes call it the mani- 
fest marginal. In a small score range dz we can assume that the 
probability of a positive answer is y = f(x) for all people. There- 
fore, within this range, the proportion of positive responses will be 
(x) f(z)dz. For the whole interval, it will therefore be given by 
a definite integration over the whole continuum. This leads to 
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formula (1.2a) which, in essence, expresses the fact that the num- 
ber of people who give a positive answer in the whole sample can be 
represented as a sum of all positive respondents taken from all 
strata which have a simi'ar degree of ethnocentricity. 

Equation (1.2a) contains the whole answer to our problem. The 
manifest response patterns in a pure test could obviously be deduced 
from the trace lines if the population distribution were known. For 
we also know the trace lines for joint occurrences. So by reasoning 
identical with that just carried out, we get the following set of equa- 
tions: 


(1.2b) i. (2) felz) f5(2) faz) ++ = Deze: 


It is important to be clear as to the distinction between a trace 
line y = f(x) and the population distribution y = ¢(z). In both 
equations xz represents the same continuum. But the meaning of 
y is different in the two equations. In the population distribution 
y represents the proportion of all people out of the total sample who 
have a certain degree of ethnocentricity. There is of course only 
one population distribution for the whole sample. But there are as 
many trace lines as there are items in the test. And for each trace 
line the value of y indicates the proportion of people with a given 
degree of ethnocentricity who make a positive response to an item. 

Equations (1.2) tell how the manifest response frequencies, as 
they come from an empirical observation, are related to the hy- 
pothetical trace lines and ¢(z). The frequencies on the right hand 
of the equations are known. The functions on the left side of the 
equations are unknown. The purpose of further discussion will be 
to decide what about f(x) and ¢(z) can be learned from these equa- 
tions. 

The term accounting equations is profitably used for (1.2). They 
express the postulate that the joint occurrences and therefore the 
interrelationships between the test items are accounted for by the 
existence of one underlying continuum. Inversely, they also de- 
scribe the way in which the test ‘‘measures”’ this continuum. We 
can now add the following points to our summary : 


h. A function $(x) is introduced which describes the distribution of a 
sample of people over the z-continuum, i.e., the underlying charac- 
teristic which the test is devised to study. 
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i. It is possible to set up a series of accounting equations which show how 
the manifest response patterns on the one hand are related to the latent 
trace lines and the population distribution on the other. (Equations 
1,2). 


j. The problem is how one can, with the help of these accounting equa- 
tions, learn the nature of the trace lines and the population distribution 
after an empirical test has provided the frequencies of all possible re- 
sponse patterns which are made by a sample of people. 


4, THE USE OF POLYNOMIAL TRACE LINES 


In their most general form, not much can be done with equations 
(1.2). As usual in such cases, we have to make certain simplifying 
assumptions if we want to carry out actual computation. A variety 
of such assumptions can be thought of. For the purpose on hand, 
it is most useful to assume that the trace lines are polynomials: 


f(z) =y=atbaeteet+-:--- 


The use of polynomials has a number of advantages. It is well 
known that a very large variety of shapes can be approximated by 
polynomials of rather low degree. Even if the trace lines are rather 
complicated functions, they can most likely be developed into a 
Taylor’s series and thus approximated by a polynomial. Further- 
more, the accounting equations have some simple and useful prop- 
erties if the trace lines are polynomials.? Consider, for instance, 
equation (1.2a). Assume for the sake of the example that the trace 
lines are all of second degree; then the manifest marginal of an item 
is related to its trace line in the following way: 


(1.3a) pPi=atbM™ + ¢M® 


In other words, the marginal is a linear aggregate of the moments 
M© of the distribution function. A similar result would be true 
for the joint occurrences. The relative frequency, for instance, with 
which respondents give a positive reply to two items would be given 
by the following formula: 


(1.3b) pis = aa; + (aid; + abi)M™ + (asc; + ajc; + 0))M 
+ (b,c; + bc) M® + ce;M© 


2 Professor L. J. Savage has pointed out to this writer that a more general assumption 
would be that all the trace lines are linearly dependent upon m of them. In such a more 
general theory the number m of independent trace lines would play the same role as the 
degree m of polynomial trace lines plays in the present text. For the purpose on hand, 
polynomial trace lines are sufficient to describe the relevant relationships between the 
items of a test and an assumed underlying continuum. 
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By combining the last two formulae, we find that the cross prod- 
uct, [47] = p.; — psp;, between the two items takes on an especially 
simple form: 


(1.80) [ij] = bb; [M® — M] + csc; (M® — M™) 
+ (bic; + b:) (M® — MOM) 


By thinking in terms of polynomial trace lines, our task becomes 
narrowed. We have to solve equations of the type (1.3): on the 
left side there are relative frequencies known from empirical obser- 
vation ; on the right side, there are forms built from moments of the 
distribution function and from coefficients of the trace lines. 

The concept of polynomial trace lines permits us to raise and an- 
swer an important question, namely: how should we decide on the 
degree of the polynomial which a given material requires? Should 
the trace lines be straight lines? Should they be quadratic para- 
bolas as every high school student knows them from his study of 
conics? Should they be cubic parabolas or of even higher degree? 
It turns out that the appropriate degree for the hypothetical trace lines 
can be derived from the manifest material. 

The general idea can be explained by the following example: Let 
us focus our attention on the joint occurrences of all pairs of two 
items which can be formed from a test. These joint occurrences 
can be set down in the form of a matrix. In the first row and in the 
first column of this matrix we put down the marginals of each item. 
Into each box we put one joint occurrence, from manifest observed 
data. The diagonal of the matrix is left open except for the left 
upper box where we put down the size of the sample, which will be 
1, if we think of the marginals as proportions. The whole idea will 
be quite obvious from the following scheme: 


1 Pr pe ps Mm 


P1 : Pi Dis Puc’: 
Pe Par : Das pu -** 
Ds Pa p32 : Du-°° 


It can be proved that the rank of this matrix depends upon the 
degree of the polynomial trace lines, from which the joint frequen- 
cies are supposed to have generated. If the degree of the trace lines 
is m, then the rank of this matrix will be (m +1). Any determinant 
in this matrix of order (m + 2) will vanish. 

The importance of this fact lies in its inversion. The matrx 
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which was just put down consists of empirical manifest data which 
-we can inspect. If it has a rank of, say, 4—if all determinants of 
order higher than 4 vanish (after reasonable error allowances have 
been made)—then we can assume that the trace lines are of degree 3, 
that they are cubic parabolas. Actually, the conditions are more 
complicated because a similar rule also applies to the matrices which 
can be formed from the higher joint occurrences, for instance, of 
three items at a time. The full extent of these conditions will be- 
come clear in the second section of this chapter. What matters at 
the moment is this: From an inspection of the manifest joint occur- 
rences as they come up in a concrete test, we can infer the degree of the 
polynomial trace lines with which we have to work in a concrete case.* 
It will turn out later that the study of joint occurrence matrices 
is a crucial step in the approach to testing developed here. For 
this introduction, it is enough to keep the following two points in 
mind: 
k. For a variety of reasons, it is helpful to assume that the trace lines are 
polynomials. 
1. The degree of the polynomials is determined by the actual data in a 


given problem. This degree can be derived from a study of the joint 
occurrence matrices. 


5. RELATION BETWEEN POLYNOMIAL TRACE LINES 
AND MANIFEST DATA 


The first step then in any empirical investigation is to make a 
decision as to the degree of polynomial trace lines one will want to 
work with. Because of the fallibility of actual data, this will always 
be somewhat arbitrary, and we will come back to this point in the 
next chapter. At the moment, the question to be discussed is this: 
What do the joint frequencies tell us about the coefficients of the 
trace lines and the nature of the underlying distribution curve? 
To clarify the whole trend of thought, we shall discuss here the 
case of linear trace lines. We assume then that the joint frequen- 
cies of all order satisfy the necessary conditions and especially that 
the matrix of paired joint frequencies, which was exemplified above, 
has the rank 2. The following accounting equations then link the 

3 We are leaving out from this general discussion certain degenerate cases which are 
of considerable practical importance but which do not add anything to the general idea. 
Suppose, for instance, that all trace lines are cubic parabolas but all have one coefficient 
missing. Their formulae then would be y = a + cz* + dz* and the rank of the matrix 


would be 3 and not 4. The implications of such a possibility will be briefly referred to 
in the next section of this chapter. 
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manifest data to the coefficients of the linear trace lines and the 
moments of the distribution function (the use of the summation 
symbol is obvious): 


(14a) pp =a;+bM™ 
(1.4b) py = asa; + (2b; + a,))M% + 60,M® 


(1.40) pie = 1004 + Yeam a Yao + b,bjb.M® 


From (1.4a) and (1.4b) we deduce that :‘ 
(1.5) Cig] = pis — pops = bd; {M® — M%} = b,bj0? 


where o? = M® — M“”, the variance of the distribution curve 
(x) around its mean M® = M. 
If we take three items 7, 7, k we can deduce from (1.5) that 


(1.6) — a = & 


This introduces the symbol S; = ie 
J 


Solving (1.4a) for a,, the equations of the trace lines then read 
=y. = (>, — MS) 4 & 
(1.78) fila) = = (pe — MS) 4 Si 


The coefficients of the trace lines are thus determined by the 
manifest data (p; and S,) and the two lowest moments of the distri- 
bution function (M and ¢). 

The implications of the result are clarified if one attempts to 
graph the trace lines on the assumption that the manifest data are 
given from an actual test. Obviously, this cannot be done, because 
we do not know the first two moments of the distribution curve. 


4 The symbol [ij] to designate the “cross product” of a fourfold table will, in the next 
section, turn out to be the special case of a more general symbolism. There it will be 
applied to tables where the number of cases is not equal to 1. In the present pre- 
liminary discussion, where we deal with frequency proportions only, no distinction need 


be made between [ij] pertaining to actual frequencies and fl the standardized cross 


product. The distinction will appear, however, in the next section. A similar distinc- 
tion will be necessary for the magnitude S; to be introduced in equation (1.6). 
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Suppose, however, we graph one of the trace lines arbitrarily, say, 
through the origin and at an angle of 45 degrees with the z-axis. 
Then, relative to this one line, all the others are perfectly fixed. 
As can be seen from equation (1.7a) the first choice determines im- 
plicitly the two moments and thereafter the position and direction 
of all the other lines is determined, too. Another procedure would 
be to draw our (z,y) system fixing the average of the distribution 
curve at 0 and its standard deviation at 1. Then, again, the equa- 
tions of the trace lines would be perfectly determined as follows: 


(1.7b) fiz) = yi = pit Sea 


Before pursuing further this point, another consequence of equations 
(1.4) should be noticed. From (1.4c) we can compute the third 
moment, which turns out to be: 


(1.8) M® = o°K + M(M? + 30%) 


= Pis— Pipsps_ (21 , Pr, Ps 

TE BBS (3 tats 

It is obvious that any further higher moments can be computed 
merely by adding additional accounting equations after (1.4c), 
starting with the joint occurrences for four items. 

If the trace lines are of higher order, then the number of moments 
to be introduced rises rapidly. A manifest frequency p., in the 
case of a cubic trace line, for example, would be represented by an 
equation of the type of (1.4c) where on the right (latent) side a 
ninth moment appears. It seems that then more than just the 
first two moments have to remain undetermined. The matter is 
still under investigation. For the time being we have to restrict 
ourselves to the following additions to our summary: 


m. For polynomial trace lines the accounting equations link the manifest 
joint frequencies with the coefficients of the trace lines and the mo- 
ments of the distribution function. 

n. If the trace lines are linear the whole latent structure is determined 
but for a linear transformation. 

o. This means that, for linear trace lines, we can compute the coefficients 
of the trace lines and any number of moments as long as we add fur- 
ther items to our test. Both coefficients and moments will be ex- 
pressed as functions of the manifest data and the first two moments 
of d(z). 

p. In the case of higher degree polynomials more moments seem to re- 
main undetermined. This general case has not yet been fully ex- 
plored. 
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6. THE EQUIVALENT LATENT STRUCTURE 


The results of the preceding paragraph show that we are not yet 
in the stage of performing actual computations. The introduction 
of polynomial trace lines led from the very general equations (1.2) 
to the much more specific equations (1.4); but, obviously, some fur- 
ther simplification has to be introduced if we want to get account- 
ing equations where everything is determined. This step can be 
taken in a variety of directions. We could, for instance, establish 
how many unused independent accounting equations are left in 
the model after we have fixed the degree of the polynomial trace 
lines to be tested. Then we could introduce the most general dis- 
tribution function with the given number of parameters. The 
accounting equations then could be solved for the coefficients of the 
trace lines and the parameters of $(z). 

In the present text one specific solution is followed up which 
corresponds to the assumption that ¢(x) is a point distribution 
with (m + 1) discrete points; if the total population is assumed to 
be 1, then its proportions concentrated at these (m + 1) points are 
the m parameters of the distribution curve which are required for 
optimum results. Such a specification of ¢(z) is tantamount to 
assuming a@ partitioning of the whole population into a number of sub- 
classes. Each of these subclasses is assumed to be concentrated at 
a different point on the continuum. For intuitive illustration one 
might think of the way some writers talk about social classes. Im- 
plied in some social theories is the idea that society is stratified into 
a number of groups with rather distinct cleavages between them. 
One possible approximation to such an idea would be to assume 
that along some social stratification continuum the population has 
a discrete point distribution. The mathematics of such a model 
of segmentalization will be the main topic for the rest of this chapter. 

Under the assumption of a discrete distribution there are no peo- 
ple outside a given number of points (class positions). The ac- 
counting equations, instead of being integrals, therefore become 
finite sums. The probabilities of answering “yes” to a number of 
items would have to be introduced only for the points of concentra- 
tion. Let us, as an example, assume three points. Let there be 
V1, ¥11, and yy proportions of the population concentrated at each 
point. Designate the latent probabilities pertaining to these three 
points by pr, pir, and pir respectively. Then the manifest margi- 
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nals of two items would be related to these latent parameters by 
the following equations: 


Pi = Vipn + Pupin + Yinpmn 
(1.9) pe = Vip + Yupin + YimPri 
My tar + in = 1 


Equations (1.9) mean this: The probability of a positive answer for 
the whole sample is the weighted sum of the segmental probabilities. 
If we know how many people at each point say “yes” to an item, 
then we also know how many do so in the whole population. 

The trace lines in this simplified model could be looked upon in a 
variety of ways. In the most rigid sense one might have to say 
that they, too, consist only of discrete points. More loosely, one 
might consider them polygons connecting the three points. But 
for the sake of a more general and consistent algebra, it is preferable 
to continue to think of them as polynomials. In order to reduce 
the redundancy of such a model to a minimum the degree of the 
polynomial will be determined by the number of points. In the 
case of three classes, for instance, we have for each item three latent 
probabilities which would determine a polynomial of second degree, 
a quadratic parabola. In the case of (m + 1) points the degree of 
the polynomial would be m. 

Following exactly the same argument as before, we can then 
write down the general accounting equations for a “pure test’’: 


(1.10) pie. = Prpieprypies ss  Yupucpiypiee + + PrpuPiunspute: ** 


It is important to realize all the implications of equations (1.10). 
If we have, let us say, six items, the system would consist of 64 equa- 
tions. But there are only 18 segmental probabilities and 3 propor- 
tions for the segments of the population. Nevertheless, these 64 
equations for 21 unknowns do not lead to contradictions. This is 
due to the postulate that the trace lines are all quadratic parabolas 
and that, therefore, the manifest data satisfy certain conditions, one 
of which is that the joint occurrence matrix as discussed above 
would have the rank 3. 

Equations (1.10) have a form quite analogous to equations (1.2). 
It will be recalled that we multiplied the trace lines f;(z) and the 
distribution function ¢(z) and then performed an integration over 
the z-continuum. Now, we multiply at each point of the 
zx-continuum the segmental probabilities p, and the population 
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proportions vy, and then we perform a finite summation over all 
points. It is for this reason that these new equations can be 
called the equivalent finite accounting equations. 

We have not yet shown the conditions under which a partition 
exists which has the properties just outlined. This will be done in 
the next section of this chapter. At this point, we will take its 
existence for granted and consider what its implication will be for 
the progress of our discussion. The solution of equations (1.10) 
would give us the following information: 


(1) The coefficients of the polynomial trace lines. 


(2) The proportion of the sample which has to be assumed at each point 
of concentration. 


Both these results can be formulated in still another way. In- 
stead of (1) we can say that we would find the latent probabilities 
corresponding to the concentration points. These probabilities 
would be the ordinates of m points from which the polynomial trace 
lines, and therefore their coefficients, would be determined. In- 
stead of (2) we could say that we would know the segmentalization 
of the population or its discrete distribution. Without going into 
further detail, it might be added that this whole geometrical picture 
would be determined but for a linear transformation. The dis- 
tances between the points of concentration on the z-axis can be 
drawn arbitrarily. 

The segmental probabilities and the proportion of people at each 
point of the distribution curve we shall call latent parameters. 
The whole system of these latent parameters will be called the 
latent structure of the manifest test material. To the derivation, 
computation, and use of the latent structure the rest of our two 
chapters is devoted. We can now conclude our introductory sum- 
mary as follows: 


q. If the polynomial trace lines are of degree m, then a specially manage- 
able model can be developed by assuming that the population is con- 
centrated at (m + 1) points. 


r. In this case, finite accounting equations establish a relationship be- 
tween the manifest data and the underlying continuum on the same 
logical principle as do the general accounting equations. 


s. They do not, however, utilize the probabilities of positive response for 
each point on the z-continuum; they work with segmental probabilities 
as would exist for (m+ 1) points of the underlying continuum. 
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t. The ensuing equations are based on finite summations over all points 
and can be completely solved. (Equations (1.10).) 


u. The solutions of these equations are called the latent structure. This 
latent structure is a determinable equivalent of the original model 
with which we started. It gives for each test item the trace lines and 
(m + 1) proportions for the discrete distribution function. 


v. The parameters of the model are determined but for a linear trans- 
formation on the z-axis in as much as the distances between the points 
of concentration are arbitrary. 


In the present approach two groups of elements can easily be 
distinguished. On the one side we have manifest empirical data: 
the answers of people to test items, singly and in combinations of 
items of all order. On the other hand, we present a model whose 
existence can neither be proved nor disproved directly. We repre- 
sent the process of test building in terms of trace lines, claiming that 
whatever a test builder considers can be translated into more or less 
specific assumptions as to trace lines. We also claim that the idea 
of measuring an underlying continuum with the help of an itemized 
test can be fully expressed by certain accounting equations. Since 
such a “formalization” of the process of test building is a postula- 
tion not subject to direct verification, it can be judged only in terms 
of its reasonableness and ultimate usefulness. 

Any specific model, however, can be tested. Its parameters will 
be related by definite equations to the manifest data, on the one 
hand, and to the general model—the trace lines and the distribution 
function—on the other hand. These relationships are of a mathe- 
matical nature and can be checked by any expert. One such specific 
case is the latent class structure, which was described in the last 
paragraph and which will now be explored further in the next sec- 
tions. In the end we shall come back briefly to the many other 
types of specifications which could be introduced into the general 
model. 


SECTION II 
THE RELATION BETWEEN MANIFEST 
DATA AND LATENT PARAMETERS 
1. THE DICHOTOMOUS SYSTEM 


In Section I we have presented the general considerations which 
lead to the notion of a latent structure and its relation to the anal- 
ysis of itemized tests. The main idea is simple. A population is 


880 LATENT STRUCTURE ANALYSBIS 


considered divided into a number of latent classes. The answers 
to test items given by people within one latent class are unrelated. 
But from class to class the proportion of answers to the same item 
varies. As a consequence, the answers to two or more items for the 
whole sample will appear related. It is a generalization of a well- 
known fact exemplified by the tries number scheme: 


Es 
Es 16 


16 4 4 20 24 20 24 


In the first two boxes the answers to two dichotomous items are 
unrelated: whether people are positive or negative on one item they 
have the same ratio of positive and negative replies on the second 
item. But when the two tables are added up box by box, we get 
the table on the right and here a relationship appears: people who 
give 8 positive reply to one item are also more likely to give a posi- 
tive reply to the other. 

Inversely, one can start with the table on the right side and look 
for ways to find the two tables on the left. For many items and 
several latent classes this is what latent structure analysis intends 
to do. 

In the present chapter this problem will be treated in a general 
way. Therefore one need not think of the latent classes as ordered 
am any way nor as employing the segments of an underlying con- 
tinuum. The more general approach will be connected with test 
problems in Section VI. There it will be seen that the general 
solutions to be given now permit us to go beyond the program of 
the preceding section and to deal with several underlying continua. 
In the present section stress will be laid not only on solutions 
achieved but also on work still to be done and especially on aspects 
of the mathematical problem which are not yet clarified. 

Our starting point will always be a set, S, of m dichotomous items 
such as a series of true-false questions or of attributes an individual 
might or might not have. As will be shown later, the whole ap- 
proach can be immediately applied to items which come in more 
than two classes, for instance, trichotomies.® 

® The letter S here is an abbreviation of the term “system.” It should not be con- 
fused with the symbol S; which has appeared in Section I as a computational magnitude 


formed from manifest data. This magnitude will reappear again in Section III. The 
discussion of the dichotomous system S is restricted to the present Section IT. 
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The m dichotomies permit 2" combinations. We could repre- 
sent the system, S, by these 2” “response patterns.” 

But for the present purposes it is better to represent the system, 
S, by the joint positive frequencies of all orders. Let the m items be 
arranged in an arbitrary order and then give Arabic numbers to 
them. Let n be the size of the sample, n; be the number of people 
who say ‘‘yes’” to the first question, n, be the number of people 
who say ‘“‘yes”’ to the first two questions, etc. The number of in- 
dexes which such a joint positive frequency has will be called its 
order. The highest order which can occur is of course m and be- 
longs to the number of people who answer all items positively. The 
zero-order frequency is identical with the size of the sample. 

According to a well-known proposition in combinatorics,* the 
number of these joint frequencies is 2". There are, then, as many 
response patterns as there are joint positive frequencies. The re- 
sponse patterns are all of order m, but include positive as well as 
negative answers to each item. The positive joint frequencies are 
of all orders but include only positive replies. 

Any theory of scaling aims at giving some kind of order and 
mutual relationship to the 2" response patterns of a given set of 
items. The present theory has been formulated in such a general 
way that, as far as we can see, it covers all other procedures pro- 
posed so far and therefore permits us to establish the relationship 
between the different scaling systems now extant. While we shall 
return to response patterns later, it will be necessary first to discuss 
at some length the mathematical properties of the joint positive 
frequencies. 


2. THE REDUCTION OF A DICHOTOMOUS SYSTEM 


The relative positive frequencies for a series of dichotomies will 
be indicated by the letter p with an Arabic numeral subscript indi- 
cating to which dichotomous item it refers; relative negative fre- 
quencies will likewise be indicated by the letter g. We have 
therefore the following symbolic conventions: 


nm Ne 
nr =P, n = Pr, ete. 


a=1—p, g=1-— pr, ete. 
According to the usage of empirical social research, these will be 
called relative marginals. 


*See G. U. Yule and M. J. Kendall, An Introduction to the Theory of Statistics (12th 
ed., J. B. Lippincott Co., Philadelphia, 1940), page 17. 


882 LATENT STRUCTURE ANALYSISB 


Definition: A homogeneous dichotomous system is one where the 
higher order frequencies can be computed if the size of the sample 
and the relative marginals (the p,’s and the q,’s) are known. The 
procedure of computation is evident from the following formulae: 


or = Pips, a = Pipspra, == = PrPa, etc. 

This means nothing else, of course, but that in a homogeneous S 
the items are independent of each other. In terms of probabilities, 
the higher order probabilities are the product of the first order proba- 
bilities. 

In empirical research, when a system of items is unrelated in this 
way there is, so to speak, nothing further to explain. If items are 
related, then explanation usually consists of introducing additional 
items so that for certain subsamples the relationships disappear. 
This well-known procedure of successive cross tabulations is for- 
malized here by the following definition: 

Definition: A dichotomous S of m items is called reducible into \ 
strata if the following conditions prevail : 


(a) There exist \ homogeneous systems of m items. 


(b) A unique correspondence is established between each frequency in the 
original system and one frequency in each of the homogeneous sys- 
ten 


(c) By adding corresponding items in the homogeneous systems we obtain 
the corresponding item in the original system. 


The relationship between a reducible system, S, and its com- 
ponent strata will be symbolized by the following equation: 


S=8;+S8n+-:-:- +S; 


To illustrate this definition we present in Table 1 a simple ex- 
ample where m = 4 and X = 2. The systems are given in two 
forms: (A) through their sixteen response patterns and (B) through 
their sixteen joint positive frequencies. The arrangement of the 
scheme makes clear the kind of correspondence which exists be- 
tween the various systems. It is worth remembering that the fre- 
quency Mx is equal to that for response pattern + + + +, and 
for example my — nwy gives the frequency of response pattern 
=F se es 


™See Yule and Kendall, op.cit., page 19, for rules of computation. 
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The frequencies in the homogeneous systems we call “latent” and 
the original frequencies ‘“manifest.’”” The reader should satisfy 
himself on two counts. The sum of corresponding latent frequen- 
cies is the corresponding manifest frequency. And the two latent 
systems are homogeneous; that is, the higher order frequencies are 
the products of the first order frequencies. From part B of Table 1, 
the reader may check the relation between ni and nim. In the 
second and third columns (in the two latent subclasses) mz: = 


TABLE 1 


A Repucrion ror Tox Casn m = 4, X = 2 


A. IN TERMS OF RESPONSE PATTERNS 


1 2 8 4 Total Latent class I Latent class [I 
+ + + + 72.2 71.9 3 
+ + + - 72.8 68.4 4.4 
+ + - + 56.8 54.2 2.6 
+ =—- + + 44.0 42.7 1.3 
- + + + 6.3 5.9 4 
+ + - - 90.8 51.6 39.2 
+ —- + - 59.7 40.6 19.1 
+ - - + 43.7 32.2 11.5 
- + + - 11.4 5.6 5.8 
- + - + 8.0 4.5 3.5 
SiS = §.2 3.5 1.7 
+ - - = 201.0 30.6 170.4 
- + - - 55.7 4.2 61.5 
- - + - 28.4 3.3 25.1 
- - -— + 17.8 2.6 15.2 
- - - = 226.2 2.5 223.7 

B. IN TERMS OF JOINT FREQUENCIES 
n 1,000 424.3 575.7 
nm 641 392.2 248.8 
ne 374 266.3 107.7 
™ 300 241.9 58.1 
nm 254 217.5 36.5 
Tas 292.6 246.1 46.5 
nu 248.7 223.6 25.1 
nua 216.7 201.0 15.7 
Na 162.7 151.8 10.9 
na 143.3 136.5 6.8 
nu 127.7 124.0 3.7 
N23 145.0 140.3 4.7 
Tin 129.0 126.1 2.9 
nin 116.2 114.6 1.6 
Tm 78.5 77.8 7 
Tis 72.2 71.9 3 
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“ne. But in the first column (representing the manifest data for 


the whole sample) this is not the case. Still, row by row, the first 
column is the sum of the other two. 

A remark is in order on the similarity between this procedure of 
reduction and factor analysis. Habitually we say that test scores 
are the linear sum of the factors. But we can also say that the cor- 
relation between scores is zero when the factors are partialled out. 
The latent systems in our present scheme can be looked upon as 
relationships between items which have a “zero correlation” if the 
latent structure is partialled out. We shall come back to this inter- 
pretation in considerable detail. At the moment, however, we still 
remain in the realm of algebra, and raise the following question: 
Under what conditions can a given system, S, of m items be reduced 
to \ homogeneous subsystems? 


3. THE LATENT STRUCTURE 


Let us assume that a reduction has successfully been carried out 
for a system of \ = 3 and m = 6. Then we would have the follow- 
ing information: 


(a) We would know the number of cases in each of the three subclasses: 
the latent class frequencies. 


(b) We would know for each subclass the six “latent marginals,” that is, 
the proportion of people who give a positive answer to each of the six 
items. 

(c) From these latent marginals we could get all the higher order fre- 
quencies in each subclass. 


(d) By adding up corresponding latent frequencies we could get the corre- 
sponding manifest frequencies. 


The latent structure is summarized in Table 2. The latent struc- 
ture equations shown in Table 3 are derived from this scheme by 
multiplications within each row and additions over the rows in 
Table 2. On the left of the latent structure equations are the joint 
frequencies as they are observed from an actual test. On the right 
are latent parameters which are unknown to begin with and which 
we desire to compute with the help of the equations. 

In Table 2 the latent subsystems are put into an arbitrary order 
and indicated by Roman numerals. In the scheme each row corre- 
sponds to a subsystem and each column to an item. The relation 
between latent and manifest frequencies is given by 2° = 64 equa- 
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TABLE 2 


Larant Structure Fork m = 6 aND X = 3 


Latent class Latent marginals for ttems 
frequencies 1 2 3 4 6 6 
nr Pu P12 Prs Pu Pis pre 
"IT Pin pr Pls Pia Pits Pie 
RII Pun Pin pur pur Pills Pitts 
TABLE 3 
Tae Latenr Strucrurs Equations 
n= ny nin + nit = Yn 
m = npn + miupin + nuipim - Yaa 
Mm = nip + nupin + Rup = Yriva 
Me = mpupn + Rupinpue + MuPUUpPII2 = Years 
my = mpupr + nupiupius + NiurPunPiuUIs ol Yrvaps 
fun = UpPUPUPis + Mrpmipuapis + MUPUNPUIEPIIs = Mepapapa 
Ma = Mpupepu + AnpapPUepiiae + NUNPUMPULPIi6 = Dpapape 
pene ead Hee ee aoe oe hee = Srpupapapupapi 


tions. The reader is invited to familiarize himself with the scheme 
of Table 2 and 3, since it will be the basis for all further analysis. 

In general, the number of parameters in a latent structure is 
A(m + 1). In each of the subsystems we have m latent mar- 
ginals and to this we have to add the total number of cases or latent 
class frequencies, one for each subsystem. 

These A(m + 1) latent parameters are linked by 2” equations. 
A reduction, therefore, will in general not be possible if 2" > 
(m + 1) except if certain additional conditions are satisfied by the 
manifest frequencies so that the number of independent equations 
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is reduced to at least the number of latent parameters. In order 
to derive these conditions we have to discuss once more the presen- 
tation of the system S itself. 


4. THE JOINT FREQUENCY MATRICES 


In Table 4 we have a symmetric matrix where the left upper cor- 
ner stands for the size of the sample. The first row and the first 
column are filled by the manifest marginals of the items of the sys- 
tem. In the boxes outside the main diagonal we have the number 
of people who give positive answers to any pair of two items which 
can be formed. In the diagonal boxes we have symbols in pairs 
which so far have no meaning and are not defined in the system, S. 
They will hereafter be called the “missing frequencies.’’ We shall 
comment upon them presently. 


TABLE 4 


Tus Manirsest Jormvr OccurRencs Matrix M 


1 2 8 4 5 6 
n m ™ rs ™% ms 
1 mm (nun) m3 ne mu ny nies 
2 Mm ms (nm) Ts nm tn ne 
3 ™% ms Ts (122) cy Tes Tes 
4 ns nu nm nu (nu) ne nes 
5 Ns Ms ne nes ne (143) ne 
6 Ne nu Te tes nes Ts (res) 


Let us suppose now that we have stratified the sample by item 
1 and picked out all the people who give a positive answer to item 1. 
We then get the frequency with which these people give a positive 
reply to all possible pairs of remaining items. These frequencies 
properly inserted give M'. In this matrix the row and column of 
item 1 as well as the main diagonal will consist of missing frequen- 
cies. We shall call this a joint occurrence matrix of signature 
1. Correspondingly, we shall have such a matrix for any other 
“signature.” M™*, e.g., will originate in the following way: we first 
pick all the people who make positive responses to items 2 and 3; 
then we find by cross tabulations the number of these people who 
jointly make a positive response to the pairs of remaining items. 
This matrix has signature 23. These joint occurrences of higher 
order are then arranged in a way exemplified by Table 5. 
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The system S can be presented by the totality of such matrices. 
This presentation is redundant. Some of the manifest frequencies 
in the first line of M®, e.g., also appear in M*, in M™, in M*®, etc. 
If the M-matrices are arranged in lexical order, the rule is this: The 
rows and columns corresponding to the signature of M consist of 
missing frequencies only. All “preceding” rows and columns have 
already appeared in a “previous” M of the same order but of an 
“earlier” signature. 


TABLE 5 
Tus Joint OccuRRENcE Marsix M™ 


1 2 8 4 6 6 
Ts tun (res) (fess) Tau Tess fm 
1 Tiss (musa) (russ) (nszsz) Nise nis Rass 
2 (rasa) (russ) (azz) (112333) (112334) (71333) (rem) 
4 Team ™en (rsa) (102334) (nam) aus Tams 
5 Ts fins (Co) (eass8) fms (Mea06) fans 
6 ns Mise (ree) (resass) Nae flame (1rasae) 


In spite of this redundance, the joint occurrence matrices will 
turn out to be a powerful tool of analysis when we want to study 
the reduction of S in more detail. 


5. THE RELATION BETWEEN THE LATENT STRUCTURE AND THE 
MATRICES OF MANIFEST JOINT FREQUENCIES 


Let (N) be a diagonal matrix the elements of which are the latent 
class frequencies. For \ = 3, for instance, we have 


ny 0 0 
(NY) = 1/0 mr 0 
0 0 nit 


Let L be a matrix the elements of which are the latent marginals 
bordered by a column of unities. For m = 6, \ = 3, we have 


1 pu pis pis pu PIs pis 
1 pin pits Pus pr pus puts 
1 pri piris Pits pir PIs Piiie 


fez 


The matrix L has \ rows and (m + 1) columns in the general case. 
It is practically a replica of the latent structure scheme. 
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Finally, we shall, by the usual convention, designate by L’ the 
transposed matrix L. Then some of the equations of Table 3 can 
be put in the following form: 


M = L'(N)L 


(The reader in checking this formula will remember that premulti- 
plying a matrix by a diagonal matrix means to multiply each column 
of the matrix by the element in the corresponding column of the 
diagonal matrix.) 

New N-matrices of higher signature will be introduced by the fol- 
lowing types of defining equations: 


nipn 0 0 
(N}) = 0 niupin1 0 | 
0 0 nurpun 
Nupuprs 0 0 
(N!) = 0 Mupinpr12 0 ete. 
0 0 AIUIPUNP112 


They are all diagonal matrices. Their diagonal elements each 
correspond successively to one of the latent subsystems. Their 
“signature” indicates how many and which items are involved in 
the formation of the matrix. Each element is the product of a 
latent class frequency and an indicated number of latent marginals. 
The element in the third row of (N™*), e.g., will be niuprrePrreP 1118: 

It then can easily be seen that equations of the following type 
hold true: 

M = L'(N)L 
M® = L'(N™)L 
etc. 


All the equations in Table 3, which link the latent structure and 
the manifest positive frequencies in the system S, can therefore be 
summarized in the basic matrix equation of latent structure analysis: 


M=L(N)L  o¢ =0,1,2,3, ---, i, 12, 13, --- 


where o is the signature of the manifest M-matrix and the latent 
N-matrices, and M°® = M, of course. 
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6. THE BASIC DETERMINANTS AND THEIR PROPERTIES 


From the basic matrix equation we can derive a variety of more 
specific equations by forming what will be called basic determinants 
of the M-matrices. We shall call a basic determinant of the matrix 
M any determinant which includes the element n and some other 
elements of the first row and of the first column, but no element of 
the main diagonal (other than the element n). Such a determinant 
is formed by picking certain additional rows and columns. Once 
they are indicated, the whole determinant is known. Therefore, 
the following symbolism is self-explanatory. The first group of 
Arabic figures indicates the additional columns selected, and the 
second group the additional rows. Thus: 


n ™ Ns 


= [1,2] , |m ms ms] = (25,13) 


| Na Nie 
ng Nas nes 


If basic determinants are formed from an M-matrix with higher 
signature than zero, this will be indicated after a semicolon. From 
M?, for instance we could form 


Uc) 
M5 nas Nas) 
ns nae Naas} 


[14,56;2] = 


The basic matrix equations permit us to express these basic 
determinants in terms of latent parameters. These expressions 
have properties which can either be proved directly or derived from 
a general theorem.® 

For our purpose, it is best to single out five properties. They 
will be formulated in general terms and exemplified for the case 
A = 3. 

(a) Basic determinants of order higher than d will always vanish. 
For instance, 


A.C. Aitken, Determinants and Matrices (Oliver and Boyd, Ltd., Edinburgh, 1944), 
page 85. 
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n ny NR Ns 


(2.1) [123,456] =o|™ ™™ ™™ mm 
Me Mie Ne Na 


rie Ne Ne 


nN 
Ma Meee Yaga Yawa 
Vree Yaga Yapapa Yinpapa 
: Mowe Mrwapa Mewapa Drwapa 
Yrea Vrwape Drape Vrwapa 


1 1 0 1 pn pr Dis 
pu Pus pun O} {1 pin Piz pus 
Pis Pils PIs 0 1 Pun Prur12 Pris 
pis Pie Pine O 0 0 0 0 


= NyNiwN111 


The first part of (2.1) may be derived from the scheme of Table 3; 
the second part is a simple application of matrix multiplication. 

(b) Basic determinants of order \ can be presented as the product of 
three other determinants of order d, each one taken respectively and 
correspondingly from one of the three matrices L’,N, L. For instance, 


nm N% Ma Yirwa dina 


[12,34] = Ine ms mg! = direa Yrwapa Yreapa 


re Nhu Na dire Dirwape Dirwapa 


1 1 1 1 pn Piz 
(2.2) =| Pus Pius| Nunn |1 pm Prime 
pu pu pur 1 pun pin 


If the M-matrix is of higher signature, this will show up in the 
product which corresponds to the N-matrix. For instance, 


1 1 1 1 pn Dr 
(1 2,34; 5] = /)Pis Pirs Pris | MP uN UPN 11I1P 111s 1 pm pir 
Pu Pus Pur 1 pun put 


It is important to realize that this will be true only for basic 
determinants of order A, when ) is the number of latent subclasses. 
(c) The ratio of two basic determinants of order taken corre- 
spondingly from a matrix of higher signature M° and the matrix M 
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depends only upon the items kept constant for the formation of M° and 
not upon the items involved in the formation of the determinants. 
From the preceding example we see immediately that 


(2.3) [1.2, 3 4; 5] 


[12,34] — PuPrepuis 


Because of the importance of this ratio for practical computa- 
tions, we shall give it a special symbol y;, where the index refers to 
the item whose latent marginals are p;,pn.Prn:}; in this case we have, 
for instance, us = DysPusPrus- 

In the same way we would find, e.g., that 


(2.3b) [ab, cd; 23] 


(ab, cd] = PePiaPrui2PisPriisPir1a = Les 
? 


where a, b, c, and d indicate any items other than 2 or 3. 

The ratio between such corresponding determinants of order \ 
will be called their link. It has great importance for practical com- 
putations as well as for the whole theory. 

(d) Links are multiplicative in the sense that 


(2.4a) Mik... = MifljMe ee 


This can be seen immediately from the formula just developed. The 
result can be expressed in still another way. We can see that, e.g., 


Mik _ 

(2.4b) ioe 
This means that the result (c) can be generalized in the following 
way: The ratio of two basic determinants of order \ taken corre- 
spondingly from two M-matrices of different signature depends 
only upon the signature of the two matrices and not upon the items 
involved in the formation of the determinants. 

It will help to clarify the meaning of (2.3) and (2.4) if they are 
exemplified by the simple case \ = 2, and m = 5. According to 
(2.3) we have, e.g. 


[12;5] _ [34;5] 
[12] [34] 


This means we stratify a sample according to its answer to item 5. 
Then the “relationships” between items 1 and 2 might be lower than 
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they were for the whole sample. This “drop” is expressed by the 
ratio Hs = 12 5], Keeping item 5 constant also introduces a 


[12] 
“ 8 [34 55] : “ 
drop”’ in all other cross products, e.g., “Tay What item 5 “takes 
out”’ of the relationships between any two other items depends upon 
the “stratifier” (item 5) only and is the same for all ratios £1935], 


[ 1 _ [1254] | [1255] zs 
12;45] _ [12;4 1255] i yo 
fi2] ~ [12] 4 [12] The “drop” in 
duced into a relationship [12] by two simultaneous stratifications 
(according to items 4 and 5) is the product of the “drops” induced 
by the two stratifications induced independently from each other. 


In its alternative form, (2.4) indicates that: 


[12;45] _ [12;5) 
[12;4] [12] 


(2.4) states that 


This means that stratification by item 5 induces a drop in the rela- 
tionship between items 1 and 2 which is the same, irrespective of the 
level of stratification performed previously with the help of other 
items. In a somewhat more general formulation: 


[iisk05] _ [iiik5] _ [iis5] 
ligskt]) [3k] [J] 


(e) Determinants of orders less than d take a somewhat more com- 
plicated form. Just for the record the following example is added 
for the case A = 3. 


n 1 1 1 Pu 
(2.5a) [1,2] = = mn 
TM Ty Pi. pr 1 pm 
1 1 1 pu 1 1 1 pm 
+nyni1 . + rim : 
Pi Pur 1 pun pris PII 1 pum 
If \ = 2 this reduces to 
(2.5b) [ij] = nya (pri — pris) (pry — Pus) 


The formulae (2.5) which give the ordinary cross product in 
terms of the latent parameters will turn out to be especially useful 
for computational purposes. 
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7. CONDITIONS FOR REDUCIBILITY 


It is now simple to formulate the properties which a system, S, 
has to have in order to be reducible to a “sum” of \ homogeneous 
systems. 

(a) The rank of all joint occurrence matrices has to be X. From 
proposition (a) in the previous paragraph, it is clear that the rank 
cannot be larger than A; in a variety of ways, preferably from the 
definition of “links,” it can be seen that the rank cannot be smaller 
than X. The product of the latent marginals would become indefi- 
nite if all basic determinants of order \ were zero. A similar con- 
clusion can be drawn from the actual computation of the latent 
parameters, to be discussed presently. 

(b) The ratio between corresponding basic determinants of order 
taken from two matrices M’ and M should have the same value, 
whatever items within M”’ are used to form these ratios. Or, in 
other words, “‘links’’ should depend only upon the signature of a strati- 
fied joint occurrence matrix. 

(c) Links should be multiplicative in the sense indicated in the 
previous paragraph under finding (d). 

There are then essentially three types of conditions which a 
reducible system has to fulfill. One type (a) takes place within all 
M-matrices. The second type (b) takes place between any matrix 
of higher signature and M, the matrix of joint paired occurrences. 
The third type (c) takes place between M and two or more matrices 
of higher signature. Not all of these conditions are independent. 
It would probably be desirable to show generally that the number 
of independent conditions is 2" — (m+ 1)A. This is a rather 
laborious problem of combinatorics. It has been carried through 
for the specific cases of \ = 2 and } = 3, but not yet in the general 
case. 

For many purposes it is useful to give the conditions of reduci- 
bility in a somewhat different form, by introducing cross-product 
matrices. ‘Take any basic determinant formed from the matrix M 
for instance: 


nm Te Ns 


(2.6) | eel O94 86] 
re Ms Ms Nes 


Ne Nie Nee Nee 


By pivotal condensation around the left upper element we can 
give this determinant a new form: 
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[1,4] [2,4] [8,4] 


(1,5) [2,5] (8, 5) 
[1,6] [2,6] [3,6] 


(2.6b) , 


where the symbols [t,j] are the well-known cross products of all the 
fourfold tables which can be formed between the positive and nega- 
tive occurrences of any pair of items. 

Now this form has an analogy to a correlation matrix. Its rank 
is obviously one less than the original form. The tetrads in this 
new form are third order determinants in the original form. For 
instance, 


nm mT 
(2.6c) [1,4] [2,5] = [1,5] [2,4] =n ae ae =n (1 2,4 5] 


Mm Nis Ns 


These tetrads will vanish in the case where \ = 2. In other words, 
instead of using the original matrix M we could use its condensed 
form, the cross-product matrix. Its rank would ber = A — lina 
system which is reducible to a “sum” of \ homogeneous systems. 

At this point let us return to the ‘‘missing frequencies.’”? Symbols 
like n,; were originally not defined. But from the general matrix 
formulation at the end of paragraph 5 in this section, a definition 
enforces itself. In the case of three latent subclasses, for instance, 
it would have to be as follows: 


Ne = Nipre + Nupud + niuPind 


This equation is not an empty formalism. Once the latent param- 
eters on its right side are computed from manifest data, we can then 
compute the numerical values of the missing frequencies. An in- 
tuitive interpretation comes close to the notion of reliability: the 
number of people who in two successive interviews would answer 
the same question twice in a positive way; this assumes, of course, 
that the whole test is truly “pure” and the repeated answers are 
determined only by the class position of the respondents and by 
otherwise random elements. It might be added that from a further 
analysis of these missing frequencies are derived additional condi- 
tions of reducibility of a rather complicated nature. Because they 
will not be needed in our subsequent discussion, they will not be 
considered further here. 
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It is important to see the conditions of reducibility against the 
background of the whole theory. We started out in Section I with 
a model, which consisted of a trace line for each item and a distri- 
bution function for the whole population. We found that from the 
type of manifest data which come from an itemized test, the coeffi- 
cients of the model cannot be mathematically deduced without fur- 
ther assumptions. Among the various specifications which are 
conceivable we picked the one whose algebra was developed in the 
preceding pages: the latent structure which was equivalent to a 
concentration of the population at » points and polynomial trace 
lines of degree (A — 1). 

The problem, therefore, shifted to the computation of the latent 
structure from manifest data. To solve this task, we proceeded as 
follows: We postulated latent structures with general parameters, 
and inquired as to the manifest data which such structures would 
generate. It turned out that a given latent structure imposes cer- 
tain conditions on the manifest data which could correspond to it. 
These restrictions were analyzed in the preceding paragraph. 

Then we reversed the process. We asked how we could recognize, 
from an inspection of manifest data, which corresponding latent 
structure might, as it were, have generated it. Thus we came to 
develop the present conditions of reducibility. But one point has 
still to be brought out. A latent structure can lead to only one 
unique set of manifest data. But the reverse is not true. A given 
set of data can at least theoretically come from different latent struc- 
tures. This is due to the possibility of ‘“degenerate’’ latent struc- 
tures. We can easily imagine a structure of \ subclasses, where the 
latent marginals are subject to certain conditions. Such restric- 
tions on the latent structure will express themselves as additional 
conditions imposed upon the manifest data: additional in the sense 
of going beyond the general ones developed in the preceding para- 
graph. The effect will usually be that certain additional determi- 
nants in the manifest joint occurrence matrices vanish. In certain 
cases, these latent restrictions themselves can be computed from the 
manifest data. In others, this is not possible. A latent structure 
with Ad classes without restriction on the marginals might generate 
the same manifest data as does a structure with more than 
classes but with conditions imposed upon the latent marginals. 
This leads to the distinction of “partially degenerate’ and “fully 
degenerate’’ structures. We shall give here one example of the 
second possibility. The first possibility, that the latent restrictions 


396 LATENT STRUCTURE ANALYSIS 


can themselves be discovered from the manifest data, will be 
exemplified presently. 

Take three latent subclasses. Suppose that all items have the 
same latent marginals in two subclasses, e.g. 


pis = pir 


Then it can easily be seen that all the conditions are satisfied which 
would lead one to assume that we are dealing with a latent dichot- 
omy. All the tetrads, for instance, will vanish. This can be seen 
if one computes a cross product by using formula (2.5a) in the form 


(2.7) [47] = nyu (pus — puss) (pry — pry) + nym (puis — Pur) (pry — pri) 
+ num (pik — pin) (pry — Pry) 


By assumption the latent marginals in class I and II are alike for all 
items. Therefore on the right of the last equation the first element 
vanishes and the sum of the second two becomes: 


[9] = (pis — piri) (pi; as puj) Mi (mr + m1) 


As can easily be seen therefore any tetrad formed from four items 
would vanish: 


[9] [tk] 
(4) fk) 


In a similar way it can be shown that all other conditions of re- 
ducibility to two latent classes would be satisfied. The existence 
of three subclasses could be detected only by adding further items 
which do not have the restrictive property. Inversely, therefore: 
If the conditions of reducibility to subclasses are fulfilled, the 
latent structure is either a general one for > subclasses or a fully 
degenerate one for more than A subclasses. Which of the two 
assumptions should be made cannot be decided on mathematical 
grounds alone. However, if the latter assumption is made, then 
it is not possible to compute all the latent parameters which a latent 
structure with more than A subclasses would require: further em- 
pirical material would be needed for this computation. It is obvious 
how these findings could be generalized to cases with X > 3. Not 
much would be gained if one were to develop the notation needed 
in order to cover all possible ‘“‘degenerate’”’ cases. Some of them 
will come up presently. 
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SECTION III 


THE DERIVATION OF SPECIFIC LATENT STRUCTURES 
1. THE GENERAL PROBLEM 


The solution of the latent structure equations can be a laborious 
task, which becomes more difficult as the number of subclasses in- 
creases. No general solution has yet been found, and numerical 
procedures have been developed for special cases only. The method 
of attack may be exemplified by the case = 3 andm = 4. In this 
chapter it will be assumed that the conditions of reducibility are 
perfectly fulfilled. In the next chapter we shall deal with formulae 
for handling actual data, where these conditions are only approxi- 
mately fulfilled. 

Consider, illustratively, the latent structure of Table 6, where the 
four test items are indicated by Arabic numbers and the three latent 
subclasses by Roman numbers. 


TABLE 6 
GeneraL Latent Structure ror m = 4, = 3 


Latent class Latent marginals for items 

frequencies I 2 8 4 
ny a+ Gs + os as+cs Otc 
ny ath az + bs a: + bs aa + & 
RII a a 5) a& 


The latent marginals are now given in a different notation, which 
for practical applications has two advantages. It dispenses with 
double indices and puts differences of latent marginals in a simpler 
form: 


Pris — Pires = Cy 
pis — pur = 0; 
Pu — Pi = bs 


These differences become important because our departure will be 
formula (2.7). The cross product between two items then be- 
comes : 


(3.1) [17] = nrnirese; + miunmibidy + mini (e; — 04) (e; — 8;) = 
cen (n — nx) + Ddynn (n — 11) — mim (e5b; + c;b;) 
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There are six equations of this type. In addition, we have four 
linear equations which permit us to compute a; in terms of b; and c¢,. 


(3.2) ns = my (a; + c;) + mu (a; + Bs) + nina; = na; + mre; + nid: 


We have now introduced ten of the 2 = 16 available latent struc- 
ture equations. Of the remaining, one is always: 


n= m+ m1 + m1 


We have not yet utilized the four joint occurrences of three items at 
a time, nor Moa. 

In regard to the former, it is again useful to introduce them 
through cross products, this time of higher signature. From (2.1) 
and (2.5) we can adapt (3.1) to the following kind of equation: 


(3.3) [1234] = nz (a4 + 4) nir1aqerce 
+ m1 (Ge + 04) nir1adbibs 
+ mx (a4 + C4) m1 (Ge + by) (1 — Bi) (C2 — a) 


It can be shown that for each triplet of items only one such equa- 
tion contains independent information. 

From (3.2) we can express a, in terms of b, and cq, so that only 
terms in b; and c; appear in (8.3) for all the items involved. In 
other words, there is a simple way to construct a higher order from 
a lower order cross product. We just insert, every time that a 
latent class frequency appears in a formula, the corresponding 
latent marginal of the stratifying item. 

The latent structure scheme of the present example shows that 
we have to find (4-3) + 3 = 15 latent parameters but have sixteen 
equations. In the mathematically perfect case, there has therefore 
to exist a condition between the manifest data. It is easily verified 
that the condition for \ = 3 and m = 4 is: 


ne™Mm %T Mz 
(3.4) beanie inieiade eT 


My Ne Na Nm 
ay Nts Nem Nis 


This means that we do not have to consider mis any further; it is 
dependent upon the other fifteen manifest frequencies. Therefore, 
it is not necessary to consider a sixteenth equation. 

The problem now is to solve the ten equations (3.1) and (3.3) in 
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the ten unknowns }b,, c; (¢ = 1,2,3,4) and nm; and my. Equations 
(3.1) can be made considerably simpler. We now introduce a new 
set of unknowns with the following transformation : 


Y,=} nn (n — m1 — Mn) 


n(n — nx) 


Ce 


This transforms equation (3.1) into 


(3.5) 


(3.6) WW) yy, + 24, 


For 4 test items this gives 6 equations in 8 unknowns. 

It is not difficult to express the other 6 unknowns in terms of say, 
Y, and Z,, from (3.6). But then we have to apply (3.5) to (3.3) to 
get 4 equations in by, c, nm: and mix. This is the algebraic labor 
which has not yet been performed for the general case. But the 
general idea can be inducted from the case of \ = 2, and for specific 
cases of ) > 2, where the complete solution is relatively simple. 


2. THE COMPLETE SOLUTION FOR A = 2 
Here the latent marginals are a; + 6; and a, respectively. Equa- 
tion (3.1) then becomes: 


[17] = nrnixb3d; 


ia 


and the transformation (3.5) reduces to Y; = b; and Z; = 0. 


Equations (3.6) are then tal = Y,Y;. They can be solved for any 


triplet of three items and give: Y,; = S,; where S,; = Ve. 
We thus have the difference between the two latent marginals ex- 
pressed in terms of the latent class frequencies: 


Si 


Tena 
n 


b; = 
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We take from (3.2) that 


pe = eat Mb, 
so that 
(3.7a) a,=p:—S; J 
ni 
and 
(3.7) a: + bi = pe +S J 


Equation (3.3) in this case becomes: 
[jkjt] = nmr (a; + by) aadjdn 


Dividing by [jk] we get: 
= Wk) _ ; 
(3.8) Mi [3] = (a; + bs) a; 


[ar —g¢,. [me 
= (n+5: ai (>. S; ae 


us incidentally is a “link” in the sense of Section II, paragraph 6. 
Equation (3.8) can be multiplied and the terms collected by the 


powers of t = 4 [. This finally gives the equation: 
II 


(3.9a) f+ Kt-1=0 
where 

= St _ Pi Ms 
(3.9b) ies 8. + Sp. 


Equation (3.9a) permits us to compute the ratio ¢ = Ve 
II 


because K is known from manifest data (3.9b). By substituting ¢ 
in (3.7a) and (3.7b) the whole latent structure becomes known. 
From equations (3.7) it can be seen that we would have to take the 
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positive sign for this root of ¢, because the model requires that 
a;+b; > a. 

This sequence of computations suggests where the difficulties for 
a general solution arise and indicates in what kind of special cases 
they can be by-passed. Whenever it is possible to reduce equations 
(3.1) and (3.3) to fewer members, the algebra becomes more man- 
ageable. It becomes rather simple, if all members but one vanish. 
This is the case if there are only two distinct latent marginals for 
each item, even if \ > 2. It occurs in the “partially degenerate”’ 
structures discussed in paragraph 7 of Section IT. 

Fortunately, some of the structures which are most useful in prac- 
tical work are of just this character. We shall now discuss three 
such restricted structures, showing (a) how their existence can be 
inferred from empirical data and (b) how the latent parameters 
can be computed if the conditions of reducibility are perfectly ful- 
filled. In Chapter 11, we will give concrete examples and compu- 
tational procedures where these conditions are not quite met.® 


3. A RESTRICTED QUADRATIC CASE 


It is often convenient to think of a set of items as having a special 
kind of parabolic trace line. The probability of a positive answer is 


* Two ways to solve the general latent structure equations of paragraph 1 might seem 
promising. They do not work, however, and the reason for their failure is instructive. 
One might first raise the question why equations (3.6) should not be used for more than 4 
items and thus yield the basis for a simple solution. Now it is clear that for 6 items 
we get 15 equations, between which many relationships exist. We can form a variety 
of nonads between the cross products of [tj] which will vanish because of (2.1). There- 
fore we do not have enough independent equations left to solve for 12 unknowns. With 
5 items we have just 10 equations of type (3.6) and 10 unknowns. But unfortunately 
even here a rather complicated relationship exists between the ten cross products, as 
was shown in a letter by Professor L. J. Savage to the writer. Thus the problem is re- 
duced to 4 items as developed in the present section. 

But additional items might be used to take advantage of the “links” developed in 
Section II, paragraph 6. It was found, e.g., that 

[tk] [28;1} 

(3.10) (7k 31) [7231] 
be SS (a, + ¢1) (a: + bi) ai 

[tk] [42] 

(ak) [7] 


As the expression on the left is known from manifest data, this does indeed add an addi- 
tional equation for the latent marginals of each of the 4 items. But (3.10) contains a; 
on the right side and if we substitute from (3.2) this introduces n; and nix. Therefore, 
we need here again equations of type (3.3) for a final solution. 

This is the reason, then, why our concrete examples (except for \ = 2) come from 
“partially degenerate’ structures. These restrictions are put upon the latent mar- 
ginals, so that the equations essentially boil down to forms similar to the case A = 2. 
But the restrictions are such that the cases do not become “fully degenerate’’—the latent 
parameters, as we shall see, can actually be computed. 
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greater, the higher a respondent is on the z-continuum. But the 
rate of increase is not uniform ; the trace line is either convex or con- 
cave as shown in Figure 2. 

Type 1 trace lines would be represented by quadratic parabolas 
with a minimum at point A; trace lines of type 2 would have a maxi- 
mum at point B. The assumption in this simplified scheme is that 
the ‘‘concave”’ items all have their minimum at the same point A 
on the z-continuum and the ‘“‘convex’’ ones their maximum at B. 
No people would be found on the continuum outside the line AB. 


(2) Bey (3) linear 
eae 


Figure 2 

Such trace lines would represent a variety of situations well known 
in common test practices. Items may be worded positively or nega- 
tively as, e.g., in the following two statements: 

(a) The British Army is fighting gallantly. Agree—Disagree. 

(b) The British Army is not fighting gallantly. Agree—Disagree. 
In order to express a pro-British attitude, which we define as a 
“positive”? response, a respondent has to check “‘agree” after ques- 
tion a but “disagree” after question b. In the context of this test, 
question a is a “positively” worded question and question b is 
“negatively” worded. Suppose that respondents have a tendency 
to check “agree” rather than “disagree.” This could be true es- 
pecially for those who are in the middle range of an attitude con- 
tinuum and who therefore have a harder time in making up their 
minds how to answer. If this is the case, question a might have a 
convex and question b a concave trace line.” 

Another case in point is the handling of ‘‘don’t knows.” Suppose 
that a set of questions were asked of respondents, all of whom had 


It is important not to confuse a “positive” answer with the positive wording of a 
question. In the context of this test, ‘disagree’ on question b is a “positive” answer 
to a “negatively” worded question. 
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made up their minds on each item, and that the trace lines of each 
item were reasonably linear, such as (3) in Figure 2. Now add 
people who say “don’t know” in reply to some of the questions. 
Their position on the z-continuum will again be in the middle range 
between A and B. If we add the “don’t knows’ to the positive 
replies, then the trace line is likely to become convex; if we add 
them to the negative replies, the trace line is likely to become con- 
cave. 

So much for the psychological background of the present model. 
Its equivalent latent structure has the advantage that it can be 
easily derived from the manifest data. It has, of course, three 
latent subclasses because the trace lines are quadratic. In addition 
it turns out that for each item two of the three latent marginals are 
alike. (The proof is not given here.) More specifically, the latent 
structure would look as follows: 


TABLE 7 


Generat Latent Srrucrure ror Resraicrep Latent Tricnoromy 


Latent class Latent marginals for items 

frequencies 1 8 g 
mr a +e Gates ds + bs ay + by 
niu a a; Gs + bs a + hy 
nut a as a, 


To facilitate the understanding of this structure, we shall use the 
following set of conventions according to whether an item is ‘“‘con- 
cave”’ or “convex.” 


“concave” “convex” 
odd index numbers even index numbers 
index letters plain index letters with bar 
larger latent marginal: a + c larger latent marginal: a + } 


Table 7 is restricted to 4 items because the addition of items would 
add nothing to the basic algebra, as will be evident from the final 
formulae. 

From equation (3.1) we can easily compute the manifest cross- 
product matrix which would be generated by the latent structure of 
Table 7.11. (See Table 8.) 

1 This case is, of course, different from the “fully degenerate” structure discussed in 


Section II, paragraph 7. Here the equal latent marginals appear in different classes for 
“even” and for “odd” items. 
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TABLE 8 


Cross-Propucr Marri ror Resraicrep Latent Trichoromy 


1 8 2 4 
1 _— Crest been rnii1 beni 
3 CD = bscanirninr becsmanii1 
2 breyninit becaninii1 _ babyortr 
4 becanrny11 byesninii1 babar — 


The new symbols which appear here are introduced for the sake of 
brevity: vr = ny (nm — my) and or = min (mn — Mart). 

This latent structure has 11 unknown parameters and leads to 16 
equations. Therefore, five conditions have to be imposed upon the 
manifest data as conditions of reducibility. One of the conditions 
is equation (3.4). This condition applies because it holds for all 
cases where m = 4 and A = 3. 

Another condition can be seen from the latent structure scheme : 


12] [14] 


kT) | (32) faa) | 7° 


The existence of this condition permits the introduction of the sym- 
bol: 


(3.11b) Cae Ne see 


where of items j and & one must be concave and the other convex. 
We see, e.g., that 


[12] [13] _ [14] [13] 
[23] [34] 


Two more conditions can be derived from a further study of 
[ik] = ninminbic;. From the findings of Section II, paragraph 6, 
we know how to form [1k,j] or [1k;I]. We simply have to multiply 
any latent class frequency which appears in a formula by the corre- 
sponding latent marginals of the stratifying item. Thus we would 
find, e.g., that: 


[1233] = nz (a3 + ¢s) mii Gabecy 


and 
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[14;3] = mz (as + cs) nin Gsbacr 


This makes it evident that there exists the condition that: 


[12;3] _ [14,3] 
[12] ~ [14] 


A similar condition can be derived for “even numbered”’ stratifiers : 


[jks] _ [lk;2) 
[jk] [1k] 
There then exists for each item, convex or concave, a unique ratio: 
~ (lk; 
3.12 bee a 
(3.12) m= Tie] 


which can be formed from the crossing of any two other items, as 
long as they are convex and concave respectively. This ratio, inci- 
dentally, should not be confused with any of the links, yu; (see Sec- 
tion II, paragraph 6), which, in the present case of 3 latent sub- 
classes, would be a ratio of two corresponding determinants of 
second order, with each element being an [17]. 

There exists still a fifth condition. Because of its more complex 
nature we shall come back to it later. 

The solution now proceeds very much as in the case of A = 2. 
From an inspection of the basic schemes in Table 8 we can multiply 
various cross products to get results of the following kind, e.g., 


[12] [13] — DrcsPegnr*nirt (n — m1) 


[23] becsnynii1 = ens in 7 m) 
These lead to the findings that 
eae 
V Vy (1 = V1) 
S3 
3.13 = 
( ) C3 en. a = = 
S3 
I 
Vou (1 — vin) 


by = Ss 
Voit (1 — vir) 
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ni ni . : 
where v1 = a and yi = re these are the relative frequencies of 


those latent classes where each type of item has a marginal different 
from its marginals in the other two latent subclasses. 

We now draw on the manifest marginals, p;, of the items. Tak- 
ing first a concave item we find from Tables 3 and 7 that 


r= a + 1c) 


Using (3.13) this gives 


(3.14a) aq = ~pi — S; Ni 2 vy 


ata= pt Sy 


Vy 


For convex items the marginals are: 
Ps = a; + (1 — rir)bs 
Using (3.13) again this gives: 


1 — ¥in 
Vit 


a + by = pa + Sri 


However, we do not know in terms of manifest data the values 


(3.14b) a, = pr: — Ss 


ai 
1-y 


ana fe fi = von 
Vir 


To find them we remember that 


(3.15) ty 


m = (a1 + a) 
N2 = (a2 + be)az 


and 
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where the coefficients on the left are formed according to (3.12) from 
manifest data." 
Using (3.148) we find 


— fy? 
™ = pr _— S? + pS; (=) 


Now we form the expression 


S Pi ™ 
l = a 
(3.168) Ky a Bi + mS 


and 1; is the root of the equation 
(3.17a) t? + Kit -—-1=0 


In (3.11b) S; is defined as a positive value. Therefore é; has to 
be the positive root so that the obvious inequality a; + c; > a; is 
maintained. 

In the same way we derive the equation 


(3.17b) tur? + Kintnr — 1 = 0 
where 

= St_ Pt 
(3.16b) Km = - B+ 


It can readily be demonstrated that é: would not be different if 
instead of item 1 we had used the other concave item 3. In the 
same way, the convex items 2 and 4 are interchangeable for the 
formation of Ky; and t1:. The inverse definitions of t; and t:1: in 
equation (3.15) however have to be kept carefully in mind. 

The final solution then is given by equations (3.14a) and (3.14b) 
after the ratios of the latent class frequencies have been computed 
from equations (3.17a) and (3.17b). It will help in understanding 
the result, if one notices that K; is formed from odd indices and 


12 It is now possible to name the fifth condition for reducibility, to wit: 
tt _ [12] [34] _ [23] [14] 
fr = [13] [24] [13] [24] 


The right-hand equality merely reproduces the second condition, equation (3.118). The 
whole relationship can again be directly seen from the cross-product matrix of Table 8. 
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Kin from even indices. But the result is the same, whichever odd 
or whichever even item is used for the formation of Ky and Ki 
respectively. 


4. THE RELATION BETWEEN TWO LATENT DICHOTOMIES 


Suppose that two tests have each been found to be reducible to a 
latent dichotomy. Then by giving the two tests to the same sam- 
ple it should be possible to study the relationship between the latent 
dichotomies. To simplify the problem, we shall assume that each 
test is “pure,” i.e., the interrelationships between its items are fully 
accounted for by its own underlying continuum. But the position 
of a respondent on one continuum might well be related to his posi- 
tion on the other one. This would lead to the following latent struc- 
ture. Items belonging to test A will be designated by odd indices; 
items from test B by even indices. People can be in four latent 
subclasses according to the following self-explanatory scheme: 


Latent dichotomy A 


ns 
dichotomy B 


We shall introduce 3 items for each test and their latent mar- 
ginals will fit into the following scheme: 


TABLE 9 


Latent SrructurE ror Two Revatep Latent DicHOoToMIES 


Latent class Latent marginals for items 

frequencies 1 8 6 2 6 
nt ath a; + by a; + b a2 + be ath Ag + be 
nit a +h a; + bs a,+b, Gs a Gs 

na a a: as a: + by a, + by Ge + be 
niv a a3 as a2 a a 


For each item we have only two latent marginals since, by as- 
sumption, the probability of answering test A should not be re- 
lated to position on the latent dichotomy for B, and inversely. 

However we might well find that the two latent dichotomies 
themselves are related and that, therefore, [AB] = ninyy — nimi 
will be greater or less than zero. 
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This then is a case with m = 6 and A = 4 with restrictions upon 
the latent marginals but without restrictions on the latent class 
frequencies. There are sixteen latent parameters and 64 equa- 
tions, so the conditions imposed upon the manifest data will be 
quite numerous. We shall follow up only those which are relevant 
for a solution in the mathematically perfect case. 

The cross products between items which are both odd can easily 
be computed from formula (2.5a), e.g., 


[13] = nin (a +b — a — bi) (as + bs — as — bs) 
+ nynitr (@1 + b: — ai) (ag + bs — as) 
+ ninrybibs + nuninbibs + nunrvbibs 
+ minty -0 
= ng (n — na)drbs 


In the same way we find the cross products between even items, e.g., 
[24] = np (n — mp)babs 
The cross products between odd and even items is, e.g., 


(12] = nynqz-0-be + nini1bi-0 + ninrvbibe 
— nunibibs + ninryh-0 + nun :0-be 
= dybs (nym — nin) 


= [AB] bibs 
The scheme of the cross-product matrix therefore is as follows: 


TABLE 10 


Cross-Propuct Matrix or Two Revatep Latent DicHoromigs 


1 8 6 2 4 6 
1 = dibwva bibsya bib: [AB] bibs [AB] bibs [AB] 
3 bibsva = babs bibs [AB] babs [AB] babs [AB] 
5 bibywa babsva — bib [AB] bx [AB] bsbs [AB] 
2 = bibs [AB] bab: [AB] bsbs [AB] — bebavs babes 
4 bby [AB] babs [AB] bybs [AB] bibs = babes 
6 bibs [AB] babs [AB] bsbs [AB] _—babave bebe = 


As usual, we designate 


va = Na (n — Na) 
Up = Np (n — Np) 


The “odd” b; values and v, will be found by analyzing the odd 
items according to the formulae for \ = 2 developed in paragraph 2 
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of this section. In a corresponding way, we find the “even” bz 
values and vs. Once they are known, we have any number of odd- 
even cross products from which we can derive relationships like the 
following ones: 
ap) = 02]. 34] _ G4) 
bib, baby 


But from the treatment of the odd and even items separately we 
know that® 


for odd and even items respectively ; the S-values are formed among 
the even and odd items separately. So we finally find that: 


[+k] V valB 
(4B) = "8 Se 


From the knowledge of [AB] and the formerly acquired knowledge 
of n4 and nz we can of course reconstruct the whole latent fourfold 
table. Itis worth mentioning that [ek] 
nS Se 
called phi-coefficient of the latent fourfold table, which appears here 
as a mere algebraic byproduct. 

In paragraphs 5 and 6, Section V, this example will lead to a gen- 
eral discussion of structures with several underlying continua. 


is the Pearson r or the so- 


5. THE LATENT DISTANCE STRUCTURE 


The last structure to be solved is reminiscent of items which form 
a so-called social distance scale. We shall assume m = 3 and 
\ = 4, and introduce a number of restrictions, which all can be 
derived from the following table: 


TABLE 11 


Tae Latent Distance SrrRucrure 


Latent class Latent marginals for tiems 
frequencies 1 2 
ni 1-d 1-d& l1-d 
ni qd, 1-d 1-d 
nur d dy 1-d& 
niv a d, d, 


3 For the meaning of S see paragraph 2 of this section. 


LATENT STRUCTURE ANALYSIS 411 


Here the latent subclasses as well as the manifest items can be 
ordered in a unique way. If we assume the d-values to be less than 
.5, we can put it this way: in subclass I, the majority give a positive 
response to all three items; in subclass II only items 2 and 3 get a 
majority positive response; in subclass III only item 3 does. In 
subclass IV finally, the majority give a negative response to all three 
items. 

The subclasses can be looked on as distances: each item is en- 
dorsed by a majority through a certain number of subclasses, and 
then ‘‘breaks” and falls to minority status. The value 


x e 
qv; = ey 
j tu rh 


indicates how “far” through the latent subclasses the item 7 carries, 
and is an auxiliary latent parameter necessary for the computation 
of this structure. 

An inspection of the scheme shows that there are only seven 
latent parameters; but there are still eight manifest response pat- 
terns, and therefore eight equations linking manifest data to latent 
parameters. One of these turns out to be dependent on the others, 
as can be shown as follows: 

From (2.5a) we can derive the general relationship 


(3.18) [ij] = n? (1 — 2d,) (1 — 2d) (a) (1 — 2) 
From an inspection of Table 11, we can see immediately that 
az; (1 — dj) + (1 — 2,) dj = p; 


and therefore 


may 
(3.19) w%y=7L Bd, 


Substituting equation (3.19) in equation (3.18) we obtain 
(3.20) [ij] = n* (ps — d:) (gq; — 4;) 
where now the order of the indexes matters: 7 < j. 


From equation (3.20) we can derive that 
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(3.2) sp = UATE — (oy — da) (gs — ds) = pate — ch (1 - ch) 


But by applying (2.5) we can also show that 


(3.22) a aa = dy (1 — ds) 


Equations (3.21) and (3.22) together give us 
(3.23) nN = page — S3* 


This is a relationship between manifest constants which three 
items have to satisfy if a solution of the present problem is to be 
possible at all. If it is satisfied, then equation (3.23) permits the 
elimination of one of the eight equations. The one which we shall 
leave out is the one relating the highest joint frequency ny to the 
latent parameters. Thus we are left with the task of solving the 
following seven equations for their seven unknown latent para- 
meters: 


2; (1 — dj) + (1 — 23) dy = 7 (j = 1,2,3) 


aj (1 — dj) (1 — da) + (ae — 23) dy (1 — dh) + (A — 2) dy = “H 
tv =1 


From equation (3.22) we immediately get d, as one root of the 
equation 


d? —-d +7 =0 
From equation (3.20) we see that 


12 [23] 
A= B@—ay 8 YG -d) 

Once the d-values are known, we get the auxiliary z-values from 
equation (3.19) and finally the latent class frequencies from the fact 
that n(21 — x1) = nu, etc. In the next chapter we shall review all 
these solutions in the light of empirical data taken from Research 
Branch studies. 


CHAPTER 11 


THE INTERPRETATION AND 
COMPUTATION OF SOME LATENT 
STRUCTURES’ 


Introduction 


N THE next two sections a number of concrete examples, taken 
from Research Branch studies, are discussed in detail. Their 
purpose is threefold. First, they serve the function of illustration: 
they make more vivid and concrete the necessarily abstract deduc- 
tions of the previous chapter. Secondly, they will afford an op- 
portunity to introduce a number of new aspects of the general topic. 
Once the reader is familiar with the main argument, it is possible 
and profitable to follow up certain sidelines and details which were 
omitted in Sections II and III of Chapter 10. Finally, it will be 
seen that the actual computations involve new problems, because 
we deal in any concrete case with fallible data instead of exact theo- 
retical relationships. 

The computations have been presented in full so that the reader 
may easily check every step and form a judgment as to the character 
of the procedures and their practicality. No effort has been made 
here to present the most efficient computational procedures.? In 
many places we do not hesitate to interrupt the computational se- 
quence in order to show some relationship to the general theory. 

Thus, while the next two sections deal with empirical material, 
the main purpose still is to strengthen the presentation of the gen- 
eral theory. This is even more true for the last section, where a 
number of general test practices are related to the present approach. 

1 By Paul F. Lazarsfeld. The writer wishes to acknowledge the help his assistants 
have given him in the analysis of the examples upon which this chapter is based. Allen 
Barton has written a first report on most of the material included. Peter Rossi and 
Blanchard Lyon have added data and have carefully edited both Chapters 10 and 11. 

2In connection with further work sponsored by the RAND Corporation, intensive 


study is being made of computational short-cuts, and of standardized forms which can 
be used by the ordinary statistical clerk. 


4S 
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SECTION IV 
THE LATENT DICHOTOMY CASE 


1. THE PROBLEM 


So far we have proceeded on the assumption that all conditions 
are satisfied and that the number of latent classes is established as X. 
As yet no attempt has been made to develop the sampling error for 
the theory; on the basis of sampling error alone, we can rarely ex- 
pect these conditions to be met perfectly. One will have to make 
reasonable assumptions as to the number of latent classes with 
which one intends to work. In practice one will not start by trying 
to estimate the rank of a given joint frequency matrix; certain as- 
sumptions will be made as to what latent structure will fit the given 
data, and the work will consist in testing whether the assumptions 
are correct. 

In this section all of our numerical examples will come from ma- 
terial which permits a reduction to two latent subclasses. This is 
a special case of a model where all trace lines are linear. In psycho- 
logical terms, linear trace lines are expected for items in which 
nothing in the wording of the question would cause a sudden in- 
crease of positive replies at any point on the underlying z-con- 
tinuum. The proportion of positive answers increases at a uniform 
rate as the respondents have higher and higher z-values. In the 
hypothetical example of Chapter 10, Figure 1, question 2 would be 
of the kind where a linear trace line might be expected. How to 
decide whether this expectation is borne out by the empirical data 
is the topic of the present section. 

The trend of the argument is as follows: we compute the dichot- 
omous latent structure which comes as near to the given data as 
possible. Then we derive from it a new set of response patterns 
which satisfy all the conditions corresponding to a model with 
linear trace lines. If the computed and the original frequencies are 
reasonably alike, our assumptions are justified. Otherwise, a dif- 
ferent model has to be considered. If the deviations are too large, 
the first approach usually gives good leads as to how to proceed 
next. If the procedure is successful then the solution divides the 
given population into those who do and those who do not have a 
particular latent characteristic. 

By starting with the simplest possible structure, we shall be able 
to illustrate all the necessary steps in the handling of empirical data 
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and in the interpretation of the results. After the reader has famil- 
iarized himself with these procedures, it will be easy for him to fol- 
low the manipulation of somewhat more complex structures to be 
discussed in the next section, since many of the procedures de- 
veloped for the latent dichotomy will also apply in the subsequent 
cases. 

From the solution given in Section III it can be seen that three 
items permit us to find all the parameters of a latent dichotomous 
structure, provided that the three items have positive cross prod- 
ucts all through the system. 

Let us summarize the necessary formulae for this case. We com- 
pute for one of the items, e.g., item 1, the values 


(4.1) ae Jahns and “Giga a 


Then we compute the value 


St Py 
(4.2) es 


where 7: is the proportion of “‘positive’”’ replies on item 1 for the 
whole sample. (The same K-value would have resulted had we 
started with item 2 or 3.) We then take the positive root of the 
equation 


(4.3) @+Kt-1=0 


This root is ¢ = Ne where n; and nz are the two latent class fre- 
II 


quencies, and n; + 1 =n. Thus from ¢ we compute the two 
latent class frequencies. 


We find that 
= K+ jKi+4 
(4.4) (=> r 
and 
Mts oF 
(4.5) 2 oe “ite 


Ni — NH 


It is also helpful to remember that K = aa 
Tfeir 
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Finally we compute the latent marginals from: 


S. 
py = p53 + 7 
(4.6) 
Pus = ps — Sz 
We have dealt with the simple three-item case. Where four 
items are used, the computations are somewhat more involved. For 


the four-item case, the latent structure is given by the following 
scheme: 


TABLE 1 


Tus Latent Structure or a Latent Dicnoromy 


Latent clase Latent marginals for items 

frequencies 1 2 8 4 
nt a, + bi a, + b; a; + b; as + be 
LO $4 a as as % 


This structure has 10 latent parameters, and 16 manifest response 
patterns. Therefore, 6 conditions will have to be satisfied between 
the 16 manifest joint occurrences (they were summarized in Sec- 
tion II, paragraph 7). For the present purpose these conditions 
will be presented in a slightly different way. 

Four items can be combined into 4 “triplets.” Besides the com- 
bination {234}, item 1 can be combined with items 2 and 3, or with 
2 and 4, or with 3 and 4. This gives three values for S,, one for 
each triplet used. If we take the three triplets aN Naa item 1 


we would have 
‘Sie 22) [13] 
1 "Nn? [23] 
» _ fL2] 14) 
(4.7a) 8" = 7? [24] 


gr = (231014) 
1 ON r? [34] 
In the perfect case 


(4.7b) Sy a 8," = Si” 
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[13] _ [14] [12] _ [14] 
because fo3) — [aa] °° [23] [34] 
Section II, paragraph 6. In other words, S; 7s a value attached to 
item j irrespective of which other two items are used for numerical com- 
putations, as long as the whole system is perfectly reducible to two 
homogeneous systems. 

There are also three “links” for each item. They also are alike 


in the perfect case because 


according to proposition (a) in 


(23;1] (24,1) [8431 
wi pay = Say = aa 


Therefore yu; 1s cdeally also a value attached to item j irrespective of what 
other items have been used in the formation of this parameter. 

This necessarily leads to the finding that, under conditions of per- 
fect reducibility, K should have the same value for any triplet of 
items in the system, since, as can be seen from (4.2), in the forma- 
tion of K only S;, 4,;, and p; are involved and the latter three have a 
unique value for item j, according to (4.7b) and (4.7c) above. The 
reader should keep the following distinction in mind: it is an alge- 
braic fact that, for a given triplet of items, e.g., {123}, K computed 
from item 1 is identical with K computed from item 2, and with K 
computed from item 3, regardless of whether the entire system meets 
the conditions of reducibility. However, only under conditions of 
reducibility is the value of K constant for all different possible trip- 
lets in the system, e.g., K based on {123} and K based on {124} are 
identical. 

Thus, with empirical data, the values of S, wu, and K will vary 
from one triplet to the next. Therefore, if a latent dichotomy is to 
be fitted, adjustments have to be made. It is these adjustments 
which form the main task in actual computations. 


2. THE FITTED CROSS-PRODUCT MATRIX 


We now introduce a numerical example in order to show the com- 
putations step by step. We will use four items from a question- 
naire on attitude toward the Army. The items are: 


1. In general, how well do you think the Army is run? 
Positive answer: Very well or Pretty well 64.1% 
2. Do you think when you are discharged you will go back to 
civilian life with a favorable or unfavorable attitude toward 
the Army? 
Positive answer: Very or Fairly favorable 37.4 
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3. In general, do you feel you yourself have gotten a square 
deal from the Army? 


Positive answer: Yes, in most ways I have 30.0 
4. Do you feel that the Army is trying its best to look out for 
the welfare of enlisted men? 
Positive answer: Yes, it is trying its best 25.4 


These four dichotomies give us sixteen pieces of manifest data 
(Table 2) which can be expressed either in terms of positive fre- 
quencies or as response patterns of positive and negative responses. 


TABLE 2 


Manirest Data oF Four ITeMs on ATTITUDE TOWARD THE ARMY 


(a) As positive frequencies (b) As response patterns 

3 4 
n = 1,000 (sample size) + + + + 75 
m = 641 5 Ss a Se 69 
™% %= 374 . + + - + 55 
m = 300 marginals ne os + es 42 
mH = 254 an. a a 0 3 
ti = 295 + + _ = 96 
ta = 246 + —_ 4 — 60 
me = 217\ joint + - - + 45 
ts = 163 / frequencies - + + - 16 
rm = 141 - + - + 8 
tu = 130 > TS veer see 10 
ms = 144 . + = - - 199 
mn = 130 ( ‘ple a a 52 
mu = 17 ( Pe ienci a 25 
mm = 78) frequencies - a BE eo 16 
mau = 75 - - - - 229 


Indeed, we can get the positive frequencies from the response pat- 
terns and vice versa. It might be noted that misy is identical with 
the response pattern (+ + + +). 

We now form a cross-product matrix by computing the basic 
determinants [17]. These are simply the cross product of a four- 
fold table showing the combinations of positive and negative occur- 
rences of two items: 
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Item i 


Item j 


n=ate n=at+b+ct+d 
[77] =ad —be = n-nyj — nm; 


Table 3 shows the cross-product matrix for these four items. The 
cross products have been standardized by dividing each one by the 
square of the sample size. 

TABLE 3 


Emprmica, Cross-Propuct Marrr: [%j)/n? 


1 2 3 4 
1 as 0553 0537 0542 
2 0553 — .0508 0460 
3.0537 .0508 — 0538 
4 0542 .0460 .0538 = 


From the cross products we can then form the S-values according 
to formula (4.72). As we mentioned before, we get three slightly 
different values for every item, according to which triplet we con- 
sider. The three forms of S, from (4.7a) have respectively the 
values: 


» _ . [12] [13) _ [55,266 <_53,700 _ 
Si 7? [23] = ~ (1000)? 50,800. 50,800 ~ .2417 from {123} 


12] (14) _ [55,266 x 54,186 
5 = aie = 4) ¢000;246, Goa 7 -2551 from {124} 


In the perfect case these three values would be alike. With 
fallible data a “true” value for S will have to be estimated and in 
the present procedure the “true” value is taken as the geometric 
mean of the three actual values. The geometric mean is chosen 
because of the great ease with which it can be computed using the 
logarithms of the cross products. We translate the cross-product 
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matrix into its logarithmic form as in Table 4, which is derived from 
Table 3 and a table of logarithms. Note that the cross products in 
Table 4 are not standardized by division by n?. 


TABLE 4 


Logarirums or EmpirrcaL Cross-Propuct Matrix From Tass 2: log [27] 


1 2 8 4 
1 — 4.742458 4.729974 4.783887 
2 4.742458 — 4.705864 4.662796 
3 4.729974 4.705864 — 4.780782 
4 4.733887 4.662796 4.730782 _— 


From Table 4 we can compute directly the logarithm of the 
“true value” of S;. It can be easily verified that S; can be com- 
puted from the following formula # 


log S, = —+— J" tog fii] 


= ERIC DY Moe tit — logn 


where m is the number of items in the matrix; in our example, 
m= 4, The first summation in the formula is taken over all cross 
products involving 7. If j = 1 this would be the sum of all the 
figures in the first row of Table 4. A cross-product matrix can, of 
course, always be so arranged that any item is in the first row. The 
double sum covers all the items to the right of the diagonal which 
are not in the first row. Formula (4.8) can easily be remembered 
by noticing that the two sums on the right are formed over the 
boldface figures and the italicized figures, respectively, as indicated 
in Table 4, as long as item 7 is thought of as determining the first 
row. The numerical results (from 4.8) are, in the present ex- 
ample: 


(4.8) 


log S, = .385533 — 1 
log Ss; = .337932 — 1 
log S; = .365683 — 1 
log Ss = .346106 — 1 


The final values of the adjusted S; are therefore 


* In order to avoid cumbersome symbolism, the summations are explained in the text. 
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S; = .24296 Se = .21774 
S; = .23210 Sq = .22187 


From (4.8) we can now compute adjusted or “‘fitted” cross products. 
In the case of perfect reducibility, the relationship holds that 


(4.98) fa = 8S; 
and 
(4.9b) log [17] = log S; + log S; + 2logn 


In the case of imperfect data we can introduce “fitted” cross prod- 
ucts [1j] by applying (4.9) to the adjusted S-values. Thus, for ex- 


ample, the fitted na = SS, = .24296 x .22187 = .0539, which is 


slightly different from the original value .0542. The actual compu- 
tations are usually made with logarithms. 

The numerical values of the actual and the “‘fitted”’ cross products 
(divided by the square of the sample) are given in Table 5. The 
fitted values, of course, satisfy perfectly the condition that tetrads 
vanish. 


TABLE 6 


ComMPARISON oF ACTUAL AND Frrrep 
Cross-Propuct Marrices Dsrivep From TasLy 2 


; 2 3 4 
Actual .0553 
1 Fitted (.0529) 


Difference +.0024 


.0553 


2 (.0529) 
+.0024 

.0537 

3 (.0564) 


— .0027 


.0542 
4 (.0539) 


+.0003 
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Comparing the actual and the fitted data gives a general idea how 
hopeful it would be to try a latent dichotomy structure. If at this 
point there is wide divergence, we would know that we are on the 
wrong track. If the differences are small, we will proceed with the 
analysis.‘ 

It should be made clear that in the absence of an exact sampling 
theory this judgment can be based only on common sense and has 
no precise justification. Two general considerations, however, give 
some assistance at this point. First of all, the deviations can be of 
two kinds: they may be of approximately equal size, scattered all 
over the matrix, or they may be concentrated within a small sub- 
group of items. If, for example, the cross products among a few 
items have positive deviations in all combinations of these items, 
then this is a strong indication that the conditions of homogeneity 
for a ‘‘pure’’ test are not fulfilled (see Section I, paragraph 2, espe- 
cially equation 1.1). There is likely to exist a “group factor” which 
ties these items together beyond what can be accounted for by one 
underlying continuum. In such a case another latent structure 
has to be postulated and its existence tested, as, for example, the 
latent turnover developed in Chapter 10, Section III. In the pres- 
ent example, no triplet has all of its three deviations in the same 
direction and no group factors seem to be present. 

In this case the judgment can be guided by a second consideration, 
namely that the deviations between actual and fitted cross-product 
matrices have a direct meaning. They are the deviation of the 
actual joint frequency of the two items from the joint frequency - 
which would fit the conditions perfectly, the deviation being ex- 
pressed as a proportion of n, the sample size. Thus the difference 
between the observed [12]/n? and the fitted [12]/n? is .0024. This 
means that 7, is greater than it should be for a perfect fit by .0024 
of the sample size. Since n = 1000, nz is 2.4 cases too large to 
meet the tetrad condition perfectly. Experience shows that this 
is a difference which would not turn out to be very disturbing in the 
eventual fit of the latent structure to the manifest data. 


3. THE NUMERICAL COMPUTATION OF THE LATENT STRUCTURE 


In order to compute the latent marginals and class frequencies 
through formulae (4.5) and (4.6) we must now get the value 


‘The above procedure, while developed independently, was later found to be very 
similar to one developed for the Spearman general factor theory. See L. L. Thurstone 
Multiple-Factor Analysis (University of Chicago Press, 1947), p. 277. 
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Sj _ Pi Bi 
4.10 K= i - 
Gs) Bs Ss = pS; 


It will be useful to carry out the computation of K for one triplet, 
e.g., {123}, in all detail, since this parameter plays a basic role in 
the whole procedure. This will also provide an occasion to review 
once more the formation of the other parameters, namely S and u. 
(In working with more than 3 items, one would not compute all the 
separate K-values; one would insert in 4.10 the adjusted S-value 
computed by 4.8. The separate computations have been included 
here to give the reader complete information on one triplet.) 

Our first job is to compute the p-, S- and yu-values: 


py = Bi .641, p2 = = = .374, Ps = ms = .300 


For the S-values we use the cross products [ij]. Table 3 already 
contains the computed cross products (divided by the square of the 
sample size) ; the computation of these basic determinants is shown 
in paragraph 2. 

Therefore 


ra A ie 1 [55,266 X 53,700 _ o4izo4 


1000 50,800 


_ 1 fiz} [23] 55,266 X 60,800 _ 
a= al ia wna ee 53,700 228051 


_1 /[i3][23]_ 1 /53,700x 50,800 _ 
Si= ON Tig] = 1000 55,266 eralia 


mh] 


Finally, we require the u-values, e.g., 


— [2331 
Mi [23] 


We already have [23]. The quantity [23;1] actually is the value 
of [23] for only those who are positive on item 1: in other words, the 
cross product of the following fourfold table: 
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Positive on Item 1 
Item 2 


Item 3 


Ny = 295 346 m = 641 


(23;1] = nyniss — mista 
= 641 < 144 — 295 x 246 = 19,734 


In the same way 


(13;2] = nenies — nites = 374 X 144 — 295 X 163 = 5771 
(12;3] = ngnizg — rises = 300 K 144 — 246 X 163 = 3102 


We can now easily find the u-values for triplet {123} : 


= [2351] — 19,734 _ 
bh [23] 50,800 = .388465 


_ (1332) _ 5771 _ 


2: 
p= 28) _ 8102. oserass 


We can now compute the K-value for this triplet using any of 
these three sets of values of p, S, and uw. It is generally a good idea 
to compute it all three ways as a check against computing mistakes. 

Using p-, S-, and u-values based on item 1: 


K= .241704 641 388465 


641 ~ 1241704 + “241704 x 641 ~ “22239! 


Using the values based on item 2: 


.228651 374 107467 
= "374 ~ 1228651 + “228651 x .374 ~ ‘732382 
Using the values based on item 3: 
= .222172 .300 .0561285 232386 


300. ~ .222172 + “992172 x .300 ~ 
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The values disagree slightly due to the rounding off of errors; the 
difference is due purely to the mechanics of computation. For 
the first triplet, therefore, we can say that K = .23239. 

In a perfectly reducible case of more than 3 items, K would be the 
same for every triplet. Among other conditions, (4.7c) has to be 
satisfied to give us a unique y,-value for each item. With fallible 
data this will in general not be the case. For item 1, e.g., we obtain 


Thus, for each triplet we will find a slightly different K, even if we 
use the adjusted S-values as developed in (4.8). The adjustment 
at this point cannot be condensed into a simple formula like (4.8). 
Considerable experimentation will be necessary to find out how 
much different adjustment procedures affect the final result. If 
many items have to be considered, a sampling of triplets will be 
necessary to make the whole procedure manageable. Investiga- 
tions in this direction are under way. 

For the present example it is simplest to compute the proper K 
for each triplet by using (4.10). Then, by an application of (4.4) 
and (4.5) to this value of K, we may obtain a value of n; for each 
triplet, as follows: 


Triplet K-value nr 

{123} .23239 442.3 
{124} .45290 389.6 
{134} .30167 425.4 
{234} 23862 440.8 


The average n; is 424.5. It would also have been possible to aver- 
age the K-values and to compute one n; only. This would have 
given the result n: = 424.3, which scarcely differs from the other 
value. 

Rather than compute mn: by computing ¢ using formulae (4.4) and 
(4.5), @ nomograph relating K and ¢ directly may be employed and 
the values of ¢ corresponding to the computed value of K can be read 
off directly. This nomograph is reproduced as Figure 1. 
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Figure 1. Graph of K and t. The nomograph relates the values of K to the corresponding 
values of t. Note that there are two sets of scales. Along the upper horizontal edge are placed 
positive values of K and the line marked ‘‘+K” relates each value of positive K to the values of 
t on the right-hand vertical scale. The value of ¢ for positive values of K is found by using this 
set of scales, 

The other set. of scales is for negative values of K. Along the lower horizontal edge sre placed 
noone values of K and the line marked “—K” relates this scale to the left-hand vertical scale 
of t. 
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Once the latent class frequencies are computed, the latent mar- 
ginals follow easily from formula (4.6). We have already computed 
the adjusted S-values in paragraph 2, because they were needed for 
the fitted cross-product matrix. From (4.4) and using the aver- 


age K-value above, we may find that? = , P = .85847. Thelatent 
TI 


marginals therefore are: 


ee ae 24206 _ 
pu = pt ; .641 + 35847 7 .92402 
pin = pi — Sit = .641 — .24296 X .85847 = .43243 
STi, do Be 21774 _ 
pu = pet ies 374 + "85847 ~ 62764 
Pir = p: — Sot = .874 — .21774 X .85847 = .18708 
ee: 23210 _ 
pis = Pst ai .300 + "85847 .57036 
pus = ps — Sst = .300 — .23210 X .85847 = .10075 
Se 22187 _ 
pu = pat i 254 + 85847 51245 


piu = ps — Sd = .254 — .22187 XK .85847 = .06353 


We now have computed the complete latent structure for our 
four items, assuming that \ = 2. It can be presented in summary 
form: 

TABLE 6 


Compurgep Latenr SrrucrurEe FoR ATTITUDE TOWARD ARMY 


Latent class Latent marginals of items 

frequenctes 1 2 8 4 
my = 424.3 9240 6276 5704 5125 
nu = 575.7 43824 1871 1008 0635 


4. THE COMPUTATION OF THE FITTED RESPONSE PATTERNS 


We can now compute the set of sixteen response patterns which 
would correspond exactly to this latent structure. We have two 
latent subclasses which are both homogeneous. Therefore the num- 
ber of people giving a certain response pattern can be computed by 
multiplying the appropriate pattern of positive and negative latent 
marginals in each subclass. The frequencies are then added up to 
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get the number of respondents for a given response pattern for the 
whole sample. 

The pattern (+ — + —) would be, e.g., given by the following 
number of respondents: 


Mpnguprdru + Nrrprngii2P 1199116 (where g = 1 — p) 


In Table 7 the first column gives the response pattern ; the second 
and third columns report the contribution of each latent class to this 
pattern respectively. The fourth column reports the sum of the 
preceding two: it gives the number of people who would answer in 
this way, if the latent structure were perfectly satisfied. The fifth 
column tells how many people actually answered in each pattern. 

The discrepancies between columns IV and V are a measure of the 
adjustment which was necessary to find a solution for the given 
problem. We compare our actual and a “fitted” distribution of re- 
sponse patterns, just as we would if we had to fit a normal distri- 
bution curve to a given set of data. The fitted response frequencies 
can be computed if we have an S-value for each item and a K-value 


(and therefore a f = =) for the whole system. It is always 
Il 


TABLE 7 


Firrep anp Actua, REsponsE PatTreRN FREQUENCIES FOR 
ATTITUDE TOWARD ARMY 


T II UT IV V vI 
Response Latent Latent Total Actual Rati Col. IT 
patierns class I class IT fitted Col IV 

t++ett+ 71.9 3 72.2 75 996 
+—-++ 42.7 1.3 44.0 42 971 
++-—-+ 54.2 2.6 56.8 55 953 
t++t+- 68.4 4.4 72.8 69 940 
Sob ok 5.9 A 6.3 3 938 
+--+ 32.2 11.5 43.7 45 736 
+-+- 40.6 19.1 59.7 60 680 
--t++ 3.5 17 5.2 10 674 
++-—-- 51.6 39.2 90.8 96 568 
-+-—-—+ 4.5 3.5 8.0 8 561 
-++- 5.6 5.8 11.4 16 404 
+--- 30.6 170.4 201.0 199 152 
--—--+ 2.6 15.2 17.8 16 148 
--+- 3.3 25.1 28.4 25 117 
-+-- 4.2 51.5 55.7 52 076 
---- 2.5 223.7 226.2 229 O11 
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possible to compute such average auxiliary parameters. Whether 
the final fit to which they lead is acceptable can be decided only in 
the light of the problem at hand. We shall come back to this point 
repeatedly. 

Professor Frederick Mosteller has suggested that the goodness of 
fit could be tested by using x? with 2” degrees of freedom reduced 
by the number of latent parameters. Experiments are under way 
to see whether this idea would be borne out in hypothetical models 
based on random numbers. 

It is worth noticing that in the procedure developed two kinds of 
adjustments have been used. The joint frequencies of two and of 
three items were numerically adjusted so as to conform to the condi- 
tions of reducibility. This was done by computing adjusted 
S-values which involved joint frequencies between pairs of items 
only; and by computing an average K, involving triplets of items. 
The joint positive occurrence of all four items, however, did not 
enter the computations at all. They were disregarded in the com- 
putation of the latent parameters. Only at the end when the ulti- 
mate frequencies were computed did they enter the picture. How- 
ever, we see that there is not too great a discrepancy between the 
number of people who actually give a positive reply to all four ques- 


tions (71234) and the adjusted frequency YP Papapapie 


5. THE ORDERING OF RESPONSE PATTERNS AND THE CLASSIFICATION 
OF RESPONDENTS 


We now turn our attention to the final column in Table 7. Each 
response pattern, as it were, recruits its respondents from the two 
latent subclasses. There are, eg., 16 people who answer 
(— + + —) and 60 who answer (+ — + —). Of the former, 
49 per cent come from the positive subclass; of the latter, 68 per 
cent are thus recruited. We can use this ratio as an ordering device 
for the response patterns. Table 7 has been arranged accordingly. 
The latent dichotomy structure thus permits a unique ordering of 
response patterns. 

It is possible to go a step further in the interpretation of this or- 
dering ratio. Jt is the probability that a person giving this response 
pattern comes from class I rather than class II. This can easily be 
seen if one remembers how the two elements of this ratio were de- 
rived. We take, as an example, the people with response pattern 
(— ++ —). In class I the probability of giving this response was 
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QuPwPiuqdu; in class II, it was ginpi2Piusqus- There is no other 
place from which these respondents can be recruited. Therefore 
the reasoning of a posteriori probability applies here. The proba- 
bility that such a respondent comes from the first class is the product 
of the probability of being in class I, times the probability of giving 
this answer if one is in this class. Now the probability of being in 


class I is a therefore, the probability of being in class I and giv- 


The alternative 


ing response pattern (— + + —) is ~/uPuPrdte a3 ze, 


The a posteriori proba- 


probability for class II is “/uPusPusdi, 
n 


bility therefore that a response pattern (— + + —) comes from 
class I is: 


NmIQupPrpPr3914 
AquPuprwqiu + NyIQiInPr2P 199114 


This is exactly how the ratio under discussion was formed. 

Table 7, therefore, not only orders the response patterns; it also 
permits us to classify respondents into the two subclasses. This 
cannot be done with perfect certainty, but only with a specified 
probability attached to the judgment. We see, for example, that 
99.6 per cent of the people who answer (+ + + +) belong in class 
I and 98.9 per cent of the (— — — —) respondents belong in class 
II (because 1 — .011 = .989). These two types of respondents 
will therefore be attributed to latent classes with a very small degree 
of error. But when it comes to the 96 people who answer 
(+ + — —) the misclassification we are forced to make is much 
greater. Of this group, 56.8 per cent come from class I and 43.2 
per cent from class II. It will still be the best judgment to put 
them into class I; but we know that 41 cases of the 96 will be mis- 
classified. 

If one looks at the four items of Table 7 as a test, then a measure of 
precision can be derived from the preceding considerations. We 


may classify each respondent into that latent class where be most 
probably belongs; but we know that we cannot classify all re- 
spondents correctly. Comparing columns II and III of Table 5 
row by row, we shall have to classify the respondents in the top 10 
rows as “latent positive’; but we see that this will be correct for 
375.5 cases only (those in column II). The bottom 6 groups of re- 
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spondents will have to be called “latent negative’; but again we 

shall be correct only for the 491.7 cases in column III. The “preci- 

375.5 + 491.7 
1,000 

It is important not to confuse precision and fit. The latter corre- 
sponds to a comparison of columns IV and V in Table 7. Dis- 
crepancies are due partly to sampling error and partly to the fact 
that the assumed latent structure is not quite the one from which 
the actual data were generated. A certain amount of imprecision, 
however, will exist even if the data match the latent structure per- 
fectly. The attribution of a respondent to a latent class is a proba- 
bility judgment based on his manifest response pattern. It can 
therefore never be made with complete certainty. 

Furthermore this attribution might change as the amount of 
manifest information increases. Assume for a moment that only 
the first three items of our example had been available and con- 
centrate on the people answering (+ + —) and (+ — —). From 
Table 7 it can be seen that 151 would have been in the first group 
and 244 in the second. Their ordermg ratio would have been .717 
and .257 respectively. Now we add the fourth item. This splits 
each group in the following way: 


sion’’ of the test is therefore only = 86.7 per cent. 


Ordering ratio Ordering ratio Number 
Sor first 3 items for all four items of cases 
++-—+ .953 55 
717 
++-- .568 96 
+--+ 736 45 
257 
+o = .152 199 


By increasing our information from three to four items we may 
learn that the second group, (+ + — —), belongs after the third 
group, (+ — — +), in the ordered scheme of response patterns. 
On the basis of three items only, we would have ordered these re- 
spondents in the reverse order. The reader is also invited to note 
that the finer scheme has a higher precision—fewer respondents get 
misclassified. The effect of adding items which satisfy the same con- 
ditions of reducibility consists of improving the precision of a test. 
This is due to the fact that attribution to latent classes is a proba- 
bility procedure. We shall come back to this point at the end of 
this chapter. 
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From the examples in this section we have developed the follow- 
ing basic procedures in latent structure computations: 

(a) The fitting of cross-product matrices for the purpose of eval- 
uating the plausibility of a latent structure under consideration. 

(b) The computation of the latent parameters—the latent struc- 
ture scheme. 

(c) The reproduction of ultimate frequencies as they derive from 
a given latent structure—the fitted response patterns. 

(d) Exposition of the difference between fit and precision. 

(e) The ordering of response patterns according to the probabili- 
ties that persons in each response pattern belong to the first latent 
class. 

(f) Illustration of the role of adding to a system items which 
satisfy the origmal conditions of reducibility. 

We shall see in the next section that very much the same notions 
apply also to other structures which are more complicated than the 
latent dichotomy case. 


SECTION V 


SOME SPECIAL LATENT STRUCTURES WITH MORE THAN TWO 
LATENT SUBCLASSES 


1. A PARTIALLY DEGENERATE LATENT TRICHOTOMY 


In Chapter 10, Section III, the solution of a “restricted quadratic 
case’ was developed. One of the circumstances under which the 
case will often appear will be in surveys where a fairly large propor- 
tion of the respondents give responses of the “‘don’t know,” “‘unde- 
cided,” or similar variety. Persons giving this type of response 
are most likely recruited from the middle range of the underlying 
attitude continuum; shifting about the “don’t know’s” or “unde- 
cided’s” from the positive to the negative side in dichotomizing the 
responses to the questions will have the effect of changing the shape 
of the trace lines from convex to concave. Items which include the 
“don’t know’’ response categories along with the positive responses 
to that question will have conver trace lines. The same items with 
the “don’t knows” added to the negative response will have concave 
trace lines (see Section III, paragraph 3). 

The example we shall use is four questions taken from a Research 
Branch questionnaire on job satisfaction. In the first two ques- 
tions, the response category ‘‘undecided”’ has been classified with 
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the negative response; in the last two questions, the equivalent re- 
sponse categories have been classified with the positive reply. 


1. Would you change to some other Army job if given a chance? 


Positive response: No (23.8%) 
Negative response: Yes and Undecided (76.2%) 


3. Do you feel that everything possible has been done to place you in the 
Army job where you best fit? 
Positive response: Yes (32.5%) 
Negative response: No and Undecided (67.5%) 


2. Do you usually feel that what you are doing in the Army is worthwhile 


or not? 
Positive response: Worthwhile and Undecided (72.9%) 
Negative response: Not worthwhile (27.1%) 


4. How interested are you in your Army job? 
Positive response: Very much interested and A little but not much 


(83.8%) 
Negative response: Not interested at all (16.7%) 


In line with the convention established previously, we have num- 
bered the “‘concave” items with odd numbers and the “convex” 
items with even numbers.® 

The empirical cross-product matrix of these four items is shown 
below in Table 8, with cross products standardized by division by n?. 

TABLE 8 


EmpriricaL Cross-Propuct Matrix or Jos SATISFACTION ITEMS 


1 3 2 4 
1 = 0856 .0375 .0267 
3 .0856 — 0411 .0423 
2 0375 0411 — .0667 
4 .0267 .0423 .0667 — 


The distinguishing characteristic of four-item cross-product 
matrices in the restricted quadratic case is the existence of one 
tetrad with zero or near-zero value and of another tetrad con- 
siderably greater or smaller than zero. The zero tetrad is always 
of the form: 


5 The notational conventions which we shall adhere to in discussing the restricted 


quadratic case are as follows: 
Concave items Convex items 
Item indices odd even 


Subscript indices t j 
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(5.1) [19] [te] — [6k] [30] 
while the non-zero tetrad is always of the form: 
(5.2) [20] (ak) — [9] [ek] 
In the present case, the tetrad eased - Bae) = .00049 


which, while not quite zero, is quite small compared to the non-zero 
tetrad [13] [24] - 12] Se = .0041 which is more than eight 
non n> on 
times as large. 
Since items 1 and 3 are presumed to be “concave” items and 
items 2 and 4 to be ‘‘convex’’ items, the latent structure which 
generated the matrix has the form of Table 9: 


TABLE 9 
Latsnt SrrucrurEe or Reatricrep Latmant TRICHOTOMY 


Latent class Latent marginals of items 

frequencies 1 8 2 
nm a+ a+ cs G2 + bs ag t+ hy 
nit a as Ga + bs ae + be 
NUIT a as as a& 


Four triplets of items may be formed from the matrix of four 
items. Two of the triplets will consist of two odd items and an 
even item, {132} and {134}, and from these t; may be computed. 
The other two triplets, {241} and {243}, will consist of two even 
items and an odd one. From the latter pair of triplets, tir: can be 
computed. Incidentally, this rule obtains for matrices of any size 
in the restricted quadratic case. 


From the triplet {132} we can compute Ki as: 


— Sim Pig he _ See Ps Mae 
68) Kr pr Sizs  pS1.23 Ps Sais a3 pvS3.12 


Note that it makes no difference whether the S- and 7-values of 
item 1 or of item 3 are employed. The K is a characteristic of the 
triplet employed and does not depend (just as in the case of the 
latent dichotomy) on which concave item’s S- and 7-values are 
used. 
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Using item 1, we compute: 


__ {31012 _ _ [65,650) 7,408) _ 
Siss = {a2 [2377 Vi,000,000 (41,075) 7 2796 


238 
Pi = i999 = -2380 
— (23;1] _ _831_ _ 
1.28 has [23] 7; 41,075 .0202 
.2796 ~—-.2380 .0202 


Ky = .6276 


™ 12380 .2796 ' (.2380) (.2796) 


The triplet {134} yields another value for K:. In the case of 
perfect reducibility, K; computed from {132} will be identical with 
K; computed from {134}. In the more usual case of imperfect 
data, the two values will differ somewhat. 

Based on the triplet {134} we find K:: 


_ _ {031014} _ RED) (26,746) _ 
Siu = 4/73 [34] 1,000,000 (42,275) ~ 2°28 


238 
Pi = 3000 = .2380 


_ (3431) _ 920 
M34 [34] = 42,275 .0220 
.2328  .2380 .0220 


Ki = ‘9380 ~ 2328 (.2380) (.2328) 


= .3523 


We take the arithmetic mean of the two estimates of K:: 


K; = BB+ 82 999 
We can now compute t; = 4. a by solving the equation 
—™ eT 
(5.4) i? + Kitty -1=0 


ges Aa 
= 5) + a + 1 = .7846 


The latent class frequency ni can now be obtained from the rela- 
tionship: 
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- 7 
(5.5) m = 74 7a = 381.0 
Ki and the corresponding latent class frequency mi1 are com- 


puted in the same way from the triplets {241} and {243}. From 
the triplet {241}: 


Ki = S26 — Bey ee 2 Se Pe Me 


(5.6) - P2 a Sau pSe.4 Ps Sa.i2 pSar 


Computing Kir using item 2: 


_ fea {i2] _  [6,743) 7,498) _ 
Shu = J nm? [14] = V'1,000,000 (26,746) = 3059 


729 
D2 = 1000 = .7290 
a (14;2) _ 7231 _ 
Maus = [14] = 26,746 = .2704 
.3059  .7290 .2704 _— 7512 


Kin = "7990 ~ [3059 (.7290) (.3059) 
From the triplet {243}: 


= Som Ps, tam _ Sars Pe ats 
Kn Se.34 - pSa.3 ps S423 " pSa.ss 


Bang = [23] _ RKCRZS) (41,075) _ oss 


n? [34] 1,000,000 (42,275) * 
Dp: = oe = .7290 
ua = aa] = 12375 = .3099 
.2047 — .7290 3099 ~ 8438 


Ku = "7290 — (2547 * (7290) (.2547) ~ 
Taking the arithmetic average of these two estimates of Ki: 


Kin = BP oe = —.7975 


Solving for ti = on in the formula 


Nr 
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(5.7) tu? + Kintinn — 1 = 0 
a 2 
tur = oA + ve +1 = 1.4753 
We get 
n 
(5.8) mun = 74g = 3148 


The remaining latent class frequency nr: may be found directly, 
once 7; and 111 are known, since 


nen nny = 1,000 — 381.0 — 314.8 = 304.2 


Just as we adjust the values of K; and Ki by pooling the two 
estimates of each K obtained in each pair of triplets, so also we 
adjust the S-values in computing the item latent margiuals. For 
each item there are two S-values, one from each triplet relevant to 
that item. For item 1, e.g., we get Si23 based on the triplet {123} 
and Sa based on the triplet {134}. For lack of any rigid sam- 
pling theory, we take the geometric mean of the various S-values 
and compute an adjusted S’. 


4 
» 1 “[i3} [23] (34) _ 


n 

1 

n 

1 */[24]? [12] [23] 

fo = = 

Sy! = = (147 [34] 2791 
1 
n 


The a-values for each item are given from the relationships 


(5.10) a: = pi — S,'ty 


and the c- and b-values from the formulae 


c; = Si BE where yy = mt 
V vy 6! =< V1) n 
(5.11) 


th 


4 
3 NI 


b = —S——————_ where yyy = — 
Vn (Ll — vit) n 
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The a-values in the present case are easily computed using the 
formulae (5.10): 


a, = pi — Sy'ty = .2380 — (.2551) (.7846) = .0378 
=p;— Sti = .3250 — (.3357) (.7846) = .0616 
Gd, = p2 — Sy'ti1r = .7290 — (.2791) (1.4753) = .3172 


a = py — Si'tirr = £8330 — (.2391) (1.4753) = .4803 


Finally, the b- and c-values, from (5.11): 


7 SY SD5B1 = ee 
Vii (1 — 1) V(.3810) (.6190) 
= Si 3357 = g913 
Vor (1 — 1) ~—-V(-3810) (.6190) 
Si’ a 2791 go 
V vir (1 — vii) VO 3148) (.6852) 


Sd 2391 


us = V Pint (1 = Vir) a Vv (.3148) (.6852) sects? 


The latent structure for the present case is found by combining 
the a-, b-, and c-values according to the hypothetical latent struc- 
ture and is shown below in Table 10. 


TABLE 10 


Comrutep Largent Srrucrurp FROM Jos SaTIsFAcrion ITEMS 


Latent class Latent marginals for items 
Srequencies 1 8 2 4 
my = 381.0 563 753 918 995 
mi =-304.2 .038 062 .918 995 
ni = 314.8 .038 062 317 480 


2. CLASSIFYING RESPONDENTS ACCORDING TO A LATENT TRICHOTOMY 


In order to use the latent structure for ordering response patterns, 
we have now to compute from Table 10 the ultimate frequencies cor- 
responding to the mathematically perfect case. In doing so we 
will at the same time learn how the people who make these responses 
recruit themselves from the latent classes. Since we have three 
latent subclasses, persons giving each response pattern will be di- 
vided into three homogeneous classes and the number of persons 
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giving a certain response pattern can be computed by multiplying 
the appropriate pattern of positive latent marginals and comple- 
ments of positive latent marginals in each subclass. 

The number of persons giving the response pattern (— — + +) 
recruited from latent class I is: 


nr (1 — pn) (1 — pis) preps = (381.0) (.437) (.247) (.918) (.995) = 37.59 


The number of persons giving this response pattern recruited from 
latent class II is: 


mu (1 — pm) (1 — pia) prep = 
(304.2) (.962) (.938) (.918) (.995) = 251.0 


Finally, the number of persons recruited from latent class III is 
given by: 


miu (1 — pun) (1 — pins) punpun = 
(314.8) (.962) (.938) (.317) (.480) = 43.30 


By adding these three components we get 331.9 as a fitted frequency. 
The other figures in column VI of Table 11 were obtained by similar 
procedures. The reader will notice that a simple rule leads from 
Table 10 to Table 11. We first multiply n: by the four probabilities 
or their complements in the first row of Table 10. The same oper- 
ation is repeated in the second and third row using ny and mir. 
Then an addition over the three rows is performed. Whether for 
an item py; or (1 — p1;) is used depends on whether, in the response 
pattern in question, the answer to this item is positive or negative. 

In columns II, III, and IV of Table 11 we have not entered the 
absolute numbers obtained. Rather, the number in each class 
is expressed as a per cent of all the respondents giving a certain 
response pattern (column VI). Row by row, columns II, ITI, and 
IV add to 100 per cent in Table 11. These are the a posteriori prob- 
abilities discussed in paragraph 5 of the preceding section. They 
indicate the probability that a respondent giving a particular response 
pattern comes from each one of the three latent subclasses. 

As in the dichotomous case, we can classify each response pattern 
according to its modal probability. Reading from top to bottom in 
Table 11, the people who give the first seven response patterns will 
most likely come from latent class I, the next from class II, and the 
last eight from class III. Within these groups, we can perform a 
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further ordering, but it will remain somewhat arbitrary. The prob- 
lem is exemplified by a comparison of patterns (— — — +) and 
(-— ++ —-—). According to column IV, the former has a lower 
probability for class III than the latter. However (— + + —) 
might be called “‘bimodal.”’ It has a sizable recruitment from class 
I, which (— — — +) does not have. To order these two patterns 
requires some convention such as assigning numerical values to the 


TABLE 11 


ComPLETE ANALYsIS OF RESTRICTED LATENT TRICHOTOMY ON 
Jos SATISFACTION 


I WI II Iv Vv VI VII 


Response Per cent in latent class Fitted Actual 

pattern I II Ill total total 
++++ 99.5% 0.4% 0.1% 100% 148.4 147 
++-4+ 97.8 0.4 1.8 100% 13.4 11 
-~-+44+ 85.6 12.3 2.1 100% 133.9 128 
+++ - 85.2 0.4 14.4 100% 0.8 1 
+-4++ 80.7 16.4 2. 100% 60.0 58 
-+-—-+ 57.3 8.3 34.4 100% 17.8 27 
+--+ 48.7 9.9 41.4 100% 8.9 9 
--++ 11.3 75.6 13.1 100% 331.9 341 
SS 2.8 18.8 78.4 100% 118.9 112 
++-- 19.6 0.1 80.3 100% 0.3 4 
-4+4+- 14.9 2.1 83.0 100% 3.7 2 
+-+- 11.0 2.2 86.8 100% 2.1 5 
--+- 0.4 2.5 97.1 100% 48.2 47 
-+-- 0.7 0.1 99.2 100% 6.7 5 
a 0.5 0.1 99.4 100% 4.0 3 
--—-- 0.0 0.1 99.9 100% 101.0 100 
Totals in 


each class 381.0 304.2 314.8 1,000.0 1,000 


latent classes and then using the average of the latent marginals for 
each response pattern as an index. We shall come back to this 
point twice, in paragraphs 2 and 3 in the last section. 

Here, as in the dichotomous case, we can develop a measure of 
precision: the proportion of people who are properly classified by 
this structure. The original data, e.g., shows that of the 133.9 
which in the perfect case give response pattern (— + + +), 114.5 
come from class I, 16.5 from class II, and 2.9 from class III. But 
we cannot tell which individuals these are. All we know is their 
identical response pattern (— + + +). The best solution is to 


SOME LATENT STRUCTURES 441 


put all 133.9 in class I. But only 114.5 belong there. Repeating 
this reasoning for each line, we obtain a precision of 84.7 per cent. 
This means that 15.3 per cent are inevitably misclassified. 

The reader can check for himself that the precision would be con- 
siderably less if only three items had been used. Inversely, the 
adding of further items, satisfying the conditions of reducibility, 
would increase the precision of the test. Experiments are under 
way to study how this precision increases. It also would be desir- 
able to study further the mathematical relationship between the 
number of items and degree of precision. The formulae are easily 
derived but lead to rather involved combinatorial forms. 

Finally, the reader is asked to compare columns VI and VII of 
Table 11. The ‘fit’ of the structure to the empirical data can be 
gauged from these two columns of figures. In the present example, 
the fit is rather close. There is nothing in the interpretation of 
closeness of a trichotomous fit which goes beyond that discussed in 
the dichotomous case. 


3. THE LATENT DISTANCE CASE 


The more latent classes there are, the more will it be necessary to 
develop general ideas as to how a latent structure can be used to order 
manifest response patterns. The next example will therefore deal 
with the “latent distance” case, which of all the cases dealt with at 
the end of the previous chapter has the largest number of latent 
subclasses. For a discussion we turn to a concrete example. The 
following four items were taken from a neurotic inventory: 


1. Have you ever been bothered by pressure or pains in the 
head? 
Positive answer: Yes, often or Yes, sometimesor Noanswer 13.8% 


2. Have you ever been bothered by shortness of breath when 
you were not exercising or working hard? 
Positive answer: Yes, often or Yes, sometimes or No answer 30.7 


3. Do your hands ever tremble enough to bother you? 
Positive answer: Yes, often or Yes, sometimes or No answer 43.1 


4. Do you often have trouble in getting to sleep or staying 
asleep? ; 
Positive answer: Very often or No answer 57.1 


The assumption was made that a latent distance structure would 
fit the material. Therefore the nine parameters of Table 12 had 
to be computed: 
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TABLE 12 


Larsnt Srructors or Latent Distance Scary 


Latent clase Latent marginals of items 

frequencies 1 2 $8 4 
ni l1-d l-d l—d l—-d& 
ni a 1-d 1-—4d l1—d& 
Mr11 d, d; 1-— ds; 1—-d& 
niv dy a; d; l-d& 
fy d, d, ds d 


It is immediately obvious that here the items can be arranged in 
terms of their manifest marginals. For the value n;/n = 2,(1 — di) 
+ (1 — 2,)d,; obviously increases in the order in which the items 
are arranged in Table 12. (The reader will remember that 2; indi- 
cates the “breaking point’’ of each item—see Section III, para- 
graph 5.) 

In order to compute the latent parameters, we can discard the 
equations for ms and for the joint frequencies of three items. This 
leaves eleven equations for nine parameters and therefore two fur- 
ther conditions have to be satisfied. Here the propositions of Sec- 
tion II, paragraph 6, Chapter 10, are of no help because \ > m 
(5 > 4). Special computations have to be made to find these two 
conditions. One is again a tetrad condition: 


[12,34] = [13] [24] — [14] [23] = 0 


which means that the tetrads on one side of the principal diagonal 
vanish (but not those which cross the principal diagonal). From 
this type of condition we again derive adjusted S-values as geo- 
metric means of the values 

ta| (2b 
(5.12) S6 ad 

But here a and b, the two auxiliary items, have to satisfy the con- 
dition that a > 7 > b (where items are ordered by their manifest 
marginals). 

For the present structure, therefore, S-values are defined only for 
the “‘medial” items (in our example the medial items are item 2 and 
item 3) excluding the first and the last. Once adjusted S-values 
have been computed, it is easy to compute all the d-values of the 
medial items from the equations 
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(5.13) (ps — di) (qi — ds) = S? 


In our present example we find 


[12] = 53,684 [13] = 45,522 [14] = 31,202 
[23] = 71,683 [24] = 68,703 [34] = 93,899 
Therefore 


_ 1 /fi2) [23] (12) [24] _ 
Si? = 1 ria} ay = 099870 


1 {13} [34] [237 (34) _ 
Sf =; J ria [24] 115851 


dz = .129705 ds = .152708 


And finally 


as the smaller roots of the two quadratic equations 


dy? — dz + (psgz — S:*) = 0 
dj’ — ds + (pags — S;*) = 0 


which are solved, according to the quadratic rule, 


ints 1 — (ans — Se) = } — [4 — (07 x .098 — .009870) 


_1_ ft 
dy = 5 — af — (Page — Se) -1_ ft. (431 X .569 — .115851) 


This leaves d, and d,, (in our case d,) to be computed, and here 
the second condition mentioned above comes in. This second con- 
dition is complicated to put in manifest terms, but it derives from 
the fact that 


_¢g-—H2)_ ___[18 
Pi 1 i? (G2 — ds) 1° (Gs — ds) 


or the equivalent form 


aed = —L. . —BA_ 
‘2? (pr — ds) 1? (ps — ds) 
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Again, if we have to work with imperfect data we will put 


den -k,{- 12103) 

r= Pt F2VG — ds) Gs — as) 
GterL le {__{24] [34] _ 
(=u 3 (pe — de) (ps — ds) 


In our example we now know from the previous step that 


q2 — d, = .563295, Pz — ds = .177295 
qs — ds = .416292, ps — ds = .278292 


Therefore 


ag 1 [58,684 X 45,522 
d, = .138 — 79998 7563205 x 416292 ~ °0359616 


§ 3 tay sceel 68,703 X 93,899 
oe 1000? VV .177295 xX .278292 


= 0674072 


Once all the d-values have been found we get immediately the 
latent class frequencies from formula (3.19) in Section III. 


1 — a _ .138 — .0359616 


7 = T_ 9d, ~ 1 — 200350616) ~ °209946 
_ pr— dy 307 — 129705 _ 

= od, ~ 1 — 2(.129705) ~ 299997 

ry = PUT ds _ 431 — 152708 _ goog g0 


“1— 2d, 1 — 2(.152708) 
pi — dy _ 571 — .0674072 


nm =ny = 109.95 

Nr = Ne — nr = 129.45 
Min = Nz — Nz = 161.26 
Ny = NI, — nx; = 181.40 
ty =n — nay = 417.94 


We are now in a position to judge the fit of this structure and at 
the same time to see how each response pattern recruits itself from 
the different latent subclasses. In Table 13, the ultimate fitted 
frequencies are computed in the usual fashion from the latent mar- 
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ginals and latent class frequencies which have now been derived 
for Table 12: i.e., nies (+ + + +) recruits nyy d; dy ds (1 — d) 
cases from subclass IV, and ny d; d; ds d, cases from subclass V. Of 
course, the computation of the fitted frequencies for a response 
pattern with a negative response for an item requires the use of the 
complement of the latent marginal derived for that item in Table 
12. Table 13 presents the results in percentages for convenience. 


TABLE 13 


Comp.ere ANaLysis oF Latent Distance Scare on Neurotic INVENTORY 


oe sha Per cent of each pattern in latent class: Fitted Actual 
1234 nn nur nu niv ny total total 
++++ 949% 4.2% 0.8% 0.1% 0.0% 100% 76.8 75 
++—-—+ 90.0 3.9 0.7 4.6 0.8 100% 14.6 10 
+++-— 004 4.0 0.7 0.1 4.8 100% 5.8 8 


+-++ 668 2.9 24.5 5.0 0.8 100% 16.3 14 
-+++ 2.5 79.3 14.7 3.0 0.5 100% 108.3 110 


-~+—+ 1.3 38.9 7.2 45.1 7.5 100% 39.7 49 
+—-—+ 245 11 9.0 56.1 9.3 100% 8.0 ll 
---+ 0.0 1.4 11.9 74.3 12.4 100% 161.9 161 
++-- 36.7 1.6 0.3 1.9 59.5 100% 2.6 3 
-++- 13 40.7 7.6 1.5 48.9 100% 15.2 11 
+-+- 25.9 1.2 9.5 1.9 61.5 100% 3.0 8 
--+- 0.1 1.5 12.8 2.6 83.0 100% 60.2 64 
= SS 0.1 2.5 0.5 2.9 94.0 100% 44.0 41 
+--- 1.3 0.0 0.5 3.0 95.2 100% 10.9 9 
==. 0.0 0.1 0.5 3.0 96.4 100% 287.3 285 
Total in 

each class 109.9 129.5 161.3 181.4 417.9 1,000.0 1,000 


There are certain response patterns like (+ + + +) where 
most cases come from one latent subclass. Others like (— — — +) 
show a latent distribution with much greater dispersion but still 
with an obvious peak in one latent subclass. But some patterns 
show a bimodal latent distribution; responses of this kind seem to 
be ambiguous. They come from people who can have quite differ- 
ent positions on the latent “‘scale’’ which underlies the manifest 
data, e.g., the 49 cases who answer (— + — +). 

It is of course possible to rank response patterns by some kind of 
average latent position: the mode or the median would not require 
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further assumptions; the arithmetic mean would require that we 
arbitrarily assign numerical values, say 1 to 5, to the latent sub- 
classes. But one can also state that certain response patterns 
with a bimodal latent distribution should not be included in the 
scale. People who score this way cannot be properly ranked. 
Finally, it is possible to say that a variety of indices could be used 
to characterize each response pattern. In addition to the average, 
we would use and interpret, e.g., the standard deviation of the latent 
distribution. This last procedure would correspond to suggestions 
recently made by Clyde Coombs.® 


4. THE NOTION OF LATENT LINEARITY 


In the last two examples we were concerned with the problem of 
how a set of latent classes, arranged in a certain order, can be used 
to give order to the manifest response patterns of a dichotomous 
system. In this way, the first connection between latent structure 
analysis and the problem of scaling was established ; for the purpose 
of any scaling model is to provide a device by which manifest re- 
sponse patterns can be ordered in some rational way. The discus- 
sion, however, was predicated on the assumption that the latent 
classes were themselves ordered. But nothing in the algebra of 
latent structure analysis provides for an ordering of the latent 
classes. Therefore, an additional criterion of latent linearity has 
to be introduced. The following definition will be used: 

Assume a set of latent classes, m1, ni, + - - -; assume that for a 
given item j, the classes can be so arranged that, up to a certain 
class z;, the difference of two previous latent marginals, po + v; — 
Pw; 20; for all subsequent classes, pu); — piw—v; SO (where 
a <2; <b); call the class z; the “maximum class” for item j; if 
then each item has a maximum class for the same order of the latent 
classes, we will call this a linear order of latent classes. The struc- 
ture will be said to have latent linearity. A schematic example of 
such an arrangement is presented in Table 14. 

The latent structure of the restricted trichotomy, investigated in 
paragraph 2 of this section, and the latent distance case, discussed 
in paragraph 3, are examples of structures with latent linearity. 

Before giving an example of nonlinearity the notion of latent 
linearity should be related to the concept of trace lines. In Section 
I no limitation was put on the kind of trace lines which could exist. 


® Clyde H. Coombs, “Some Hypotheses For the Analyses of Qualitative Variables,” 
Psychological Review, Vol. 55, No. 3 (May 1948), pp. 167-174. 
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They could have several maxima, oscillate back and forth, or do 
anything else which could be expressed by polynomials of a high 
degree. It makes psychological sense, however, to single out such 
trace lines which have at most one maximum over the populated 
segment of the z-continuum. We can call these trace lines unimodal; 
they will exclude U-shaped curves and will be either monotonic 
ascending, monotonic descending or bell-shaped. Then the fol- 
lowing characterization holds true: the equivalent latent class struc- 
tures of unimodal trace lines have latent linearity. 

A large variety of structures fall into this category. Consider, 
e.g., the possible combinations of the latent distance case and the 


TABLE 14 


Scuematic ExaMpLp or Latent LINBARITY 


Latent class Latent marginals for items 

frequencies 1 2 8 4 
nmr = 100 90 .50 .40 .20 
mu = 300 .70 80 .60 .30 
mi = 400 -40 -50 90 .40 
nry = 200 .20 30 40 80 


special latent trichotomy which were just investigated. By look- 
ing at the structures of paragraphs 2 and 3 in this section, we see 
that the latent trichotomy could really be looked upon as a two- 
step latent distance case, with two questions for each step. This 
suggests that one of the structures to be investigated in the future 
should be a general latent distance scale, with several items on each 
step, which would then form a series of ordered latent dichotomies. 
On the other hand, a general scheme like that in Table 14 could be 
approximated by an adaptation of the scheme in paragraph 3. 
This would give a manageable way to analyze the Thurstone atti- 
tude scale. 

Now we shall give a concrete example of a case where latent 
linearity does not prevail. 


5. THE LATENT TURNOVER 


In paragraph 4 of Section III the case of two latent dichotomies 
was discussed. The mathematics developed there applies quite 
easily, for instance, to repeated interviews. The questions asked 
in the first interview may form one latent dichotomy ; the questions 
asked in the second interview a second latent dichotomy. The 
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replies to the individual questions might show a considerable turn- 
over from the first to the second interview. But the latent attri- 
bute which is common to all the questions might show a great con- 
sistency from one interview to the next. 

We give here an example from a repeated interview study done 


by the Research Branch on attitudes toward officers. Three ques- 
tions were asked :’ 
INTERVIEW 
Ist and 
1. How many of your officers are fair-minded enough 
to treat anybody in the outfit alike? 
Positive answer: All or Most 79% 59% 
3. How many of the officers you now serve under are 
the kind you would want to serve under in com- 
bat? 
Positive answer: All or Most 65 36 
5. How many of your company officers do you think 
would be willing to go through anything they 
ask their men to go through? 
Positive answer: All or Most 65 49 


(The results of these items on the second interview will be re- 
ferred to as items 2, 4, and 6.) 
The first interview gave the following manifest data: 


n = 324 ms = 197 
™m = 257 ms = 194 
ng = 211 ny = 172 
ns = 209 Ms = 165 


Applying the method of solution to two latent classes as given pre- 


viously, we find: 


pi = .793210 
Si = .268155 
pa = .360048 


and by the formula 


ps = .651235 ps = .645062 
S; = .341067 Ss = .824797 
Ms = .101827 Bs = .116342 


— Si Pi Bi 
K Di S.* Sar 


7 Note that these questions are numbered with odd indices on the first interview and 
even indices on the second interview. This convention will be employed throughout 


this discussion. 
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we obtain 
K = —.927238 
and 


K KR? 
i= -a t+ |Z +1 = 1.56586 


From ¢ we can compute the entire latent structure: 


Latent class frequencies Latent marginals 
(1) (3) (6) 
Na 230.14 .96446 86905 85249 
n— Ma 93.86 87332 11717 .13648 


The second interview gives for manifest data: 


n = 324 = 104 
™m = 191 ie = 133 
m = 118 nme = 105 
me = 160 nus = 97 
The auxiliary parameters are: 
ps = .589506 ps = .364198 ps = 493827 
S. = .296616 Sa = .358344 Ss = .402471 
pa = .310106 be = .0419726 be = .139362 
K = .289205 
t = .86580 
Latent class frequencies Latent marginals 
(2) (4) (6) 
ne 138.82 .93210 .77809 .95868 
n— ne 185.18 .33270 .05394 .14537 


We need now the four latent class frequencies to complete the 
latent structure. These form a fourfold table: 


Latent atiribute A 
(attitude on 1st interview) 
+ —_ 
Latent atiribute B + nN nr ny + ni = Ne = 138.8 
(attitude on 2nd 
interview) = niu nv ni + ny = 185.2 


mtn nutty n = 324 
= 7.4 = 93.9 
= 230.1 
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The marginals of this fourfold table are the latent class frequen- 
cies computed for each interview separately, as was done above. 
To fill in the values m;, 11, nix and my, the numbers who have all 
the four possible combinations of positions on the two interviews 
taken together, we need only compute the cross product [AB] of 
this latent fourfold table. 

To do this we turn to the cross-product matrix, Table 15, formed 
with all six items. The cross products have been standardized by 
division by n?. 


TABLE 15 


SranpDaRgpizpp Cross-Propucr Matrix oF REPEATED 
Inrseview Sropyr: [#j]/n* 


oo fF NN oo & 


Attention is directed to the nine cross products (in italics) in the 
right upper corner. Each is formed between an item from the first 
interview and one from the second. It is known from Section III, 
paragraph 4, that the cross product [AB] between the two latent 
dichotomies can in the mathematically perfect case be computed 
from any of the nine combinations: 


_1 [tk] 


SiSt 7 Va (n = Na) ne (n = Np) 


(5.14) [AB] = 


where as usual the barred index pertains to an even item (second 
interview) ; the unbarred to an odd item (first interview). 

With fallible data the nine values of (5.14) will give slightly dif- 
ferent results and again a geometric mean will be used to get an 
adjusted [AB]. If we have odd and even items, then the following 
formula can easily be verified : 
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(5.15) 


log [AB] = 5°) log LE] — 5a Dy 08 (i - sqm Doe EI 
aa log Na (n =— ta) NB (n = nz) 


2 
h = number of odd items, g = number of even items 


The first sum on the right is taken over all the italicized cross 
products of Table 15 (after conversion into logarithms). The sec- 
ond and the third sums are taken in the same way over the two 
blocks of boldface figures which cover the odd and the even items 
respectively. 

In the present case, h = g = 3. Therefore, the adjusted [AB] 
becomes 


[AB] = 


aT ae 53 [12] [14] [16] [23] [34] (36) [25] [45] (56) 
ees hea */[13) (15) (35) [24] (26) [46] 


At this point a correction has to be made due to the fact that these 
six items are in fact the same three questions asked on two different 
occasions. This seems likely to produce a special relationship be- 
tween the respondents’ answers to the same question when asked 
on the two occasions, in excess of the relation due to general attri- 
butes. We shall correct for this by removing the three cross prod- 
ucts which arise from the repeated use of the same item: [12], [34], 
and [56]; for the numerator of the formula we may take the sixth 
root of the remaining six cross products. 

Applying this for our example (through use of logarithms), we 
obtain the value of the cross product of the latent fourfold table: 


[AB] = 11,640 
and therefore 


my = Manet [AB] _ 230.1 x 188.8 + 11,640 _ je, 5 


We can now fill in the four latent class frequencies: 
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First interview 
+ = 
ny NII 
+ 134.5 4.3 138.8 
Suhel (41.5%) | (1.3%) 
e nit niv 
- 95.6 89.6 185.2 
(29.5%) | (27.7%) 
230.1 93.9 324 = 100% 


This result suggests that only about four cases have changed 
their basic attitude from negative to positive, while about ninety- 
six cases have gone from a favorable to an unfavorable attitude, 
the other two groups remaining constant in their attitude. 

By contrast, the turnover on the three individual items was: 


Item 1 Item 3 Item & 
First interview Firstinterview First interview 
+ = + - + ~ 


Second + 538% 5.2% + 20.8% 6.8% + 38.6% 10.8% 
interview __ 05 60% 15.4% — 35.5% 28.1% — 25.9% 24.7% 


On individual questions, the number who went counter to the 
trend from negative to positive ranges from 5 to 10 per cent. In 
terms of the latent attribute, this ‘“backwash”’ amounted to only 
1.3 per cent. The two groups who did not change are both larger 
on the latent attribute turnover than the average of the individual 
questions. The latent attribute appears to be generally less sub- 
ject to chance fluctuations one way or the other. 

Finally, we can record the whole latent structure in Table 16 and 
return to the discussion of latent linearity. 


TABLE 16 


Latent Srructurge or RePwatTep INTERVIEW Srupy 


Latent class Latent marginals for items 

frequencies 1 8 & 2 4 6 
m = 184.5 . 964 .869 .852 .932 .778 .959 
mr = 95.6 964 .869 852 333 .054 .145 
m1 = 4.3 .373 117 136 .932 .778 .959 
nv = 89.6 .373 .117 . 136 333 .054 .145 
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6. CASES WITH MORE THAN ONE UNDERLYING CONTINUUM 


Obviously, the structure of Table 16 does not permit an arrange- 
ment of the latent subclasses as requested by the definition of 
latent linearity in paragraph 4 above. The even items have a 
bimodal sequence of latent marginals. If we were to rearrange the 
order of the latent subclasses, so as to assure a monotonic sequence 
for the even items, the odd items would have a bimodal sequence 
of the latent marginals. The fact is easily explained by the nature 
of the example: it is the result of having assumed two latent dichot- 
omies. But the finding can be reversed, and then it suggests a 
generalization : 

Suppose we had analyzed the original data blindly, merely 
guided by their manifest rank structure, and had obtained a latent 
structure scheme like Table 16. Then, two interpretations would 
have been open to us. We could have said (a) that this is a four- 
class structure equivalent to one underlying continuum; or (b) that 
it represents two latent dichotomies. In the first case, we would 
have to add that the trace lines are not unimodal. Which of these 
two interpretations is preferable could be decided only in the light 
of the subject matter under investigation. However a review of 
current test practices suggests that the second interpretation would 
be preferred in most concrete problems. As a matter of fact, no- 
tions such as “‘unidimensionality’”’ or “homogeneity” of tests seem 
well covered by our concept of latent linearity. But the fact re- 
mains that it implies an additional convention and cannot be 
derived from a basic mathematical formulation. 

To round out the picture, we shall reformulate the matter in 
mathematical terms, restricting ourselves for simplicity’s sake to 
two variables. Suppose that there were two underlying continua, 
zandz. People then could be located in an (2,2) plane. A dichot- 
omous question would have a trace surface y = f(z,z), indicating 
the probability of a positive answer at each point of the underlying 
continuum plane. The position of all respondents would be de- 
scribed by a distribution surface y = ¢(z,z). The accounting 
equations, linking this model with the manifest joint occurrences 
and response patterns, would be: 


+2 + 
(5.16) Pij- = f[ f Ji (2,2) f; (x,2)- + -(@,z)dxdz 


454 SOME LATENT STRUCTURES 


As in Section I, additional assumptions will have to be made to 
make these equations more specific and accessible to empirical 
solution. Suppose that the trace surfaces in a specific case are 
quadratic; then it would be possible to partition the (z,z) plane 
into nine rectangular sectors and to assume that in each sector the 
population is concentrated at one point. Thus we would get nine 
segmental probabilities so that: 


(5.17) Pij- = MpPupiy ++ + Mrpupiuy:-+* +: 


These equations have a solution which can be interpreted as fol- 
lows: the latent class frequencies mz, mir, - « - , M1x are the number 
of people concentrated at one point in each partition of the (z,2z) 
plane. Within each partition the responses to the test items are 
independent of each other. Each test item has in each sector of 
the plane one latent probability. The response patterns of the 
whole sample are related to these latent parameters by accounting 
equations of the type (5.17). 

But obviously the solutions of (5.17) can also be viewed as be- 
longing in the frame of reference of the discussion in Section I of 
Chapter 10. They might as well refer to one underlying continuum 
divided into nine segments. Which of the two interpretations 
mentioned should be used cannot be decided on mathematical 
grounds. But what is important is this: if it is desirable to assume 
two or more underlying continua, then the algebra developed in 
Section II of Chapter 10 is adequate for dealing with this more 
complicated structure also. 


SECTION VI 


LATENT STRUCTURE ANALYSIS AND TEST CONSTRUCTION 


In the last two sections we have chiefly developed computing 
procedures for the derivation of latent parameters from manifest 
data. As we proceeded, a number of incidental observations were 
made, especially on how the latent parameters could be used to 
bring order into the manifest response patterns. It now seems 
advisable to discuss in a more general way the relation between the 
present approach and a few topics from the field of test construction. 

From the beginning it should be clear that latent structure theory 
does not propose the use of one specific test procedure in preference 
to others. Its purpose is merely to bring out clearly and precisely 
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what seems to be common to a large variety of techniques. The 
discussion below is concerned not with deciding which of several 
procedures is best, but rather with how they are related to one 
another. 


1. THE LATENT STRUCTURE OF THE SCALOGRAM PATTERN 


A considerable portion of the present volume is devoted to the 
specific types of response patterns which since the end of the war 
have become known as the Guttman scale. It was Guttman’s 
great merit to have formulated clearly the problem of analyzing 
qualitative data by intrinsic criteria. The basie equations he de- 
veloped for this purpose do not assume any underlying model or 
latent structure. All of the solutions he found have, therefore, an 
interest of their own which would not be affected by the present 
approach. 

However, the actual response patterns which Guttman has stud- 
ied would also find their place in a systematic survey of all possible 
latent structures; the perfect Guttman scale itself is especially 
easily located. It is, in the terms of the present approach, a latent 
distance scale with d = 0. Its latent structure is given in Table 
17. (See Section III, paragraph 5, and Section V, paragraph 3.) 


TABLE 17 


Latent Srrucrure oF THE Perrscr GuTrMan ScaLe 


Latent class Latent marginals for items 
frequencies 1 2 8 4 tee 
ny 1.00 1.00 1.00 1.00 
ni 0 1.00 1.00 1.00 
mn 0 0 1.00 1.00 
fv 0 0 0 1.00 
0 0 0 


ny 


The response pattern which would be derived from such a latent 
structure is given for three items in Table 18. We see that four 
scale patterns clearly emerge. The cross-product matrix has rank 
m, corresponding to the number of items. It has the interesting 
property that the tetrads formed on one side only of the main diag- 
onal will all vanish. 

The trace lines of a Guttman scale are of a peculiar form. They 
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are straight lines identical with the z-axis up to a certain point and 
then in a discontinuous jump become straight lines with an ordinate 
of 1. Algebraically they would have to be represented by poly- 
nomials of infinitely high degree. The corresponding latent class 
structure for a Guttman scale, therefore, is one with infinitely many 
classes, if one had available what Guttman calls the ‘‘full universe 
of items.” With a limited number of such items, the number of 


TABLE 18 


Responss PatTreRNs IN THE Parrect Gurrman Scarp (3 Irmms) 


Response Number in latent class 

pattern I II Ill Iv Totals 
+++ nr 0 0 0 ny 
- + + 0 Nir 0 0 ny 
+--+ 0 0 0 0 0 
++- 0 0 0 0 0 
+-- 0 0 0 0 0 
-+- 0 0 0 0 0 
--+ 0 0 nit 0 mint 
--- 0 0 0 nrv nivy 


classes which can be formed is one higher than the number of items. 
The latent class frequencies are immediately related to the manifest 
marginals of a Guttman scale by the following series of equations: 


61) n= my, j= nu am, pe mi + a + mur 


etc. 


The perfect Guttman scale has perfect precision in the sense of the 
previous section. This can immediately be seen from Table 18. 
The respondents who give one of the four admissible response pat- 
terns recruit themselves uniquely from one of the four latent classes. 
In a perfect scale, therefore, probability considerations do not enter. 

As Guttman points out, empirical data will never furnish a per- 
fect scale in his sense. To measure the degree of perfection, he has 
introduced the notion of reproducibility. It is important to realize 
that this corresponds to the notion of ‘‘fit” and not to the notion of 
“precision” in latent structure analysis. The structures we have 
discussed in the previous two sections, even if they are mathemati- 
cally perfect, always misclassify a number of people as to their lat- 
ent class position : they all have a precision of less than 100 per cent. 
In addition, they have imperfect fit; this means that the actual 
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ultimate frequencies are somewhat different from those which 
would be generated by the corresponding latent structure. The 
perfect Guttman scale would misclassify no one, but, of course, an 
empirical Guttman scale has a certain misfit. If it were perfectly 
fitted, no one would give a response pattern other than the main 
scale types. 

It is possible to compare the fit of any latent structure with that 
of any other. We can always compute fitted ultimate frequencies 
and compare them with the actual ones. But this cannot be done 
with Guttman’s coefficient of reproducibility, which can be applied 
only to the deviation of fallible data from the latent structure of a 
perfect Guttman scale. Take as an example the latent distance 
case which we have studied in Section V, paragraph 3. The d-values 
of the four items are, on the average, less than 10 per cent. The 
fit is very close as can be seen from the comparison of the last two 
columns in Table 13. The discrepancy between the actual and the 
“fitted” ultimate frequencies is 50.8 cases. Suppose we desired to 
find how well the same actual data would fit a perfect Guttman 
scale. Table 19 isa recasting of Table 13 into terms of the Guttman 
scalogram. Errors in Guttman’s sense are boldfaced. To obtain 
the coefficient of reproducibility, the errors are summed and divided 
by the product of the sample size and the number of items in the 
test; this quotient is then subtracted from 1.00. In this case the 
coefficient of reproducibility is 94.0 per cent, a fairly high value. 
Now, if we apply our standard notion of misfit to a Guttman scale, 
we have to compute the “fitted” ultimate frequencies which a per- 
fect Guttman scale with a given set of manifest marginals would 
generate. From equations (6.1) we get the latent subclasses. 
Inserting them in Table 18 (extended to four items), we get the 
full latent structure. With the usual procedure we get the fitted 
ultimate frequencies which were inserted as the last column in 
Table 19. The difference between the last two columns gives a 
measure of fit, which makes the Guttman scale comparable to other 
latent structures. The discrepancy is 532 cases out of 1,000. 

In turn, if one were to make a x? test of goodness of fit, one would 
have to remember that the perfect Guttman scale with 4 items has 
only 5 parameters—the 5 latent class frequencies—while the latent 
distance scale has 9 parameters, one more for each item. There- 
fore, a higher misfit in the Guttman scale can be accounted for by 
chance. It also should be remembered that the precision of a 
Guttman scale is always perfect. 
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Guttman has also introduced the notion of a quasi scale. While 
he has not yet given it a precise mathematical formulation, it is 
reasonable to assume that he would consider all the structures which 
have latent linearity as quasi scales. In this sense, one might say 
that the bulk of the work done in latent structure analysis is de- 
voted to the classification and study of quasi scales. 

The scalogram procedure by which Guttman studies the suitabil- 
ity of an item for a Guttman scale can be described in terms of trace 


TABLE 19 
Scatocram ANALysis oF Latment Distance Data or TaBLe 13 


Scalogram 

Response oc COTO O Actual Fitted 
pattern 1 2 8 4 1 2 8 4 total total 
t++eet4 xX xX xX X 75 138 
++-—-+4+ x XxX x x 10 0 
+++- xX xX X x 8 0 
+-+4+ xX xX XX x 14 0 
-+t++t+ xX XxX X x 110 169 
-+t+—- x XX xX x 11 

--t++t+ xX xX xX xX 141 124 
shit x x x x 49 0 
+--+ x xX xX XX 11 0 
eeecn ets x xX jxX XX 161 140 
t++-- x xX Xx 3 0 
+-—-+- x x x 8 0 
She >.< x 64 0 
Sa x x xX Xx 41 0 
ie = x xX XX X 9 0 
Ste x xX xX x 285 429 


lines (an observation made to the writer by S. A. Stouffer). The 
respondents are ordered first with the help of a preliminary score, 
say, the number of items they endorse. This corresponds (though 
very crudely, perhaps) to classifying respondents according to their 
latent class position, in as much as this preliminary score is likely 
to correlate rather highly with the underlying continuum. Then, 
for a Guttman item, the following criterion is applied: up to a cer- 
tain value of the score almost everyone should endorse the item. 
Beyond this point, presumably, no one should endorse it. In other 
words, the item should show the type of discontinuous trace line 
previously described. Now nothing prevents one from using the 
scalogram technique to approximate trace lines of other types. One 
might request, for instance, that the proportion of people endorsing 
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an item should go down fairly proportionately with the progress of 
the general score. Or, for a middle item in a Thurstone scale, one 
would request that the endorsements be highest in the middle and 
go continuously down toward the two ends of the general score. 
Stouffer’s recently developed idea has been applied to negatively 
and positively worded questions included in the same test. The 
scalogram lines show clearly the concave and convex forms required 
by general trace line considerations. Further work along this line 
has been started. 

A final remark can be made regarding the type of items which 
are likely to fit into a Guttman scale. Let us first distinguish be- 
tween two idealized extreme types of questions, which we shall call 
“projective” items and ‘“‘yardstick” items. The essence of the 
first type is that the respondent is not aware of the purpose of the 
question. Suppose we show pictures of two men which look rea- 
sonably alike and give the one an Italian and the other an Anglo- 
Saxon name. Then, let us ask a sample of people which of the two 
looks nicer. If we find certain groups of respondents who consist- 
ently dislike the man with the Italian name, we will usually accept 
such a question as a useful test of prejudice. We would expect the 
trace lme of such an item to be a rather smoothly ascending curve. 
The more prejudiced a person is, the more probable is his dislike 
for the picture of the man with the Italian name. Take, on the 
other hand, as an example of a “yardstick” item, a question which 
contains a term such as ‘Do you dislike very much . . .?” or any 
other term which connotes explicitly a position along the attitude 
continuum. The trace line of such a question is likely to make a 
sharp ascent close to the point of the underlying continuum which 
corresponds to the term used. Think, for instance, of a test de- 
signed to order respondents on a capitalistic-socialistic continuum. 
Suppose that the following two questions are included: 


1. All economic systems have their advantages and disadvantages. 


Agree Disagree 
2. Only a fool can have an opinion on a whole economic system. 
Disagree 


Both of these questions are likely to have bell-shaped trace lines; 
that is to say, they will be most frequently endorsed in the middle 
of the underlying continuum. But we would expect the second to 
have a sharper peak than the first (preliminary experiments have 
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shown that this assumption can actually be verified by the study 
of joint occurrence patterns). It is reasonable to expect that items 
fitting into a Guttman scale will be very much of the “‘yardstick”’ 
type. As a matter of fact, the mathematical conditions for a per- 
fect Guttman scale are probably equivalent to the requirement that 
the admissible items do not contain any “projective” element. 

In addition to this nonprojective character of the wording, Gutt- 
man items also have to be cumulative. Bell-shaped trace lines are 
obviously excluded. It might be mentioned that, as a consequence, 
the cross-product matrix of a Guttman scale can always be made to 
contain only positive values. This is not the case for the cross- 
product matrix of a Thurstone scale because middle items are 
admissible. 


2. THE “TRACE LINE SCORES” OF RESPONSE PATTERNS 


In previous pages, occasional remarks have been made about the 
scoring of response patterns. It remains now to discuss more sys- 
tematically the relation between latent structure analysis and the 
scoring of tests the latent structure of which has been investigated. 
It is not the purpose of the present chapter to carry such a task out 
in full ; this is left for a subsequent text in preparation for the RAND 
Corporation. But the preceding discussion would remain too in- 
complete if some of the main considerations were not summarized. 

It will be remembered from Section I that in the general model 
there corresponds a trace line y = f,;(x) to each item 7. To the 
joint occurrence of items, say, 1 and j, there corresponds a trace line 
y = fi;(x). It is quite obvious from the previous discussion that 
every response pattern has its trace line. The response patterns 
(+ —) and (— +), for instance, would have respectively the trace 
lines: 


y = fiz) (1 — f,(x)] 
and 


y = f(x) LL — fi(x)] 


The meaning of these trace lines should be quite clear by now: they 
give for every point on the underlying z-continuum the probability 
that just this response pattern can be expected; or, in other words, 
the proportion of people who give this response pattern among all 
those who have the given z-value. It is easy to generalize the dis- 
cussion of Section IV, paragraph 5, in terms of trace lines. Assume 
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for a moment that the trace lines are fully known. If a person then 
gives a certain response pattern, we still do not know his precise 
position on the underlying continuum. But we know for each 
point on the z-axis how great the probability is that such a respondent 
has his position at that point. And this is all we can ever know from 
the point of view of the present theory. Any score we might de- 
velop would only summarize this kind of probability distribution. 
To illustrate this idea we will address ourselves to the simplest but 
most basic scoring problem conceivable: how to decide which of the 
two response patterns (+ —) and (— +) should have the higher 
score. 

To be numerically concrete, we shall assume that a latent struc- 
ture was found with linear trace lines. The latent class frequencies 
were: n; = 600, m1: = 400. Two items in the test showed the 
following latent marginals: 


TABLE 20 


Scuematic EXAMPLE FOR ORDERING REesPonsE PATTERNS 


Latent class Latent marginals for items 
frequencies 1 2 
nm = 600 .90 -70 
nr = 400 .20 -30 


We know then from Section III, paragraph 2, that the manifest 
marginals of the two items are: 


pi = — = .60 X .90 + .40 X .20 = .62 


= .60 X .70 + .40 X .30 = .54 


3 
A) 
I 


m1 
n 
2 
n 


We also know from the same discussion that 


a io 600 X 400 _ 
= (90 — 20) ae = 34 
and Ss = (.70 — .30) sarenyag x ai 20 


Now we shall actually draw the linear trace lines for these two 
items. From Section I, paragraph 5, we know that in the linear 
case the slope of the trace lines is S; and their intersection with the 
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y-axis p;, provided we choose a frame of reference so that its origin 
is at the mean of the distribution curve and the units on the z-axis 
are so chosen that o = 1. 

In this discussion we also need the trace lines for the negative 
replies to an item, which obviously have the equations: 


y=1-— (p+ Sz) =¢q: — Sax 


where g; = 1 — py. 

In Figure 2a we draw the positive trace line for item (1) and the 
negative for item (2), corresponding to the pattern (+ —). Added 
is the trace line for (+ —) which is 


y = (.62 + .34z) (.46 — .20z) = .29 + .042 — 072’ 


In Figure 2b we draw the negative trace line for item (1) and the 
positive for item (2), corresponding to the pattern (— +), the 
trace line of which is also inserted. Its equation is 


y = (.88 — 34x) (.64 + .202) = .21 — lle — .072? 


Now we shall raise the question which is at the core of all scoring 
problems. Given two items like the ones in the present example, 
no one will doubt that (+ +) is the most positive reply and (— —) 
the most negative. But in what order should (+ —) and (— +) 
be inserted between the two? Is (+ —) more or less positive than 
(— +)? The intuitive inspection of Figures 2a and 2b will fur- 
nish the strong suggestion that (+ —) (2a) should have the higher, 
more positive, score. Its trace line lies more to the positive side 
than the trace line of (— +-) (2b). 

There are a variety of ways in which this intuitive notion of the 
position of the trace lines could be formalized into an index. Two 
of them deserve special attention: the maximum probability of the 
response pattern and the expected value of a respondent’s position. 

To compare the highest probabilities of our example in a general 
way, we have to compare the position of the maxima in the follow- 
ing two functions: 


(6.2a) (+ —): y = pigs — 2(piS: — q2Si1) — 27SiS: 
(6.2b) (— +): y = poh — x(peSi — gS2) — 27SiS2 


The roots of the first derivatives are respectively : 
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Figure 2a (top) and Figure 2b (bottom). 
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—n , & 
z max for (+ —) = 28, + 2S, 
(6.3) 


z max for (— +) = ae tis 


This we shall call the maximum probability score (MPS). The rela- 


tive position of (+ —) or (— +) depends on the sign of the dif- 
ference: 


(qoSi1 — prSe = (qiS: — p2S:) 
281S2 281S2 


Remembering that g = 1 — p, this difference equals SoS) oF Se. 
1wW2 


We therefore have the result that (+ —) will have a higher 
MPS than (— +) if S, > S:. It can easily be seen that this 
result can be generalized: if S, > S:, then any response pattern 


(- + +--+ —) will have a higher score than the corresponding 
pattern (- - - — +). More complex comparisons like, e.g., 
(+ + — —) versus (— — + +) can be decided by a similar rea- 
soning. 


Now, let us see what happens if the expected value is used as a 
score. People are distributed along the x-axis according to a dis- 
tribution curve, ¢(z). If, therefore, a respondent is found to give 
a response pattern with the trace line, say, f(z), the probability 
that he comes from a position z is fx(z)@(z). This is a case where 
the classical theory of inverse probability applies. The respondent’s 
expected position, therefore, is 


+ eo 
(6.4) f afralz)o(z)dz 


Calling this the expected value score (EVS) and applying it to (6.2a), 
we obtain, e.g., for (+ —) 


nam a (piS2 — q251) M® — S:S.M® 


In our frame of reference, M“ = 0 and M® = 1. M®, in this 
case, as we know from (1.8), is K, the computation of which was 
discussed in detail in Section IV (see especially paragraph 3). The 
two expected values are therefore: 
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(EVS) for (+ —) = SiS (% - # x) 
(EVS) for (— +) = SiS: u _B _ x) 


The difference between these two score values is (Si — S2) which 
is essentially the same as for the (MPS) values previously derived 
(6.3). With linear trace lines, then, both types of scores would 
lead to the same ordering of response patterns. With more items 
and with higher degree trace lines, the relationships and computa- 
tions become considerably more complicated. But the basic ap- 
proach to scoring remains the same: no notion of “position of 
items” or ‘‘position of response patterns’ is introduced into the 
model: it provides only for the notion of ‘‘position of people.” The 
score of a response pattern is the average position of the people who have 
answered the test in this specific way, expressed as an (MPS) or an 
(EVS) or any other average. 

It is clear that not only response patterns but also single items 
have scores. If the only information we have about a respondent 
is that he has answered item t positively we would have to put his 


expected position at af ;(x)p(x)dx, which turns out to be 


p:M“ + S,o with linear trace lmes or in the simplified frame of 
reference just S;, a value which played such a prominent role in all 
computations of Sections I and II. With higher degree trace lines 
similar values can be derived. 

It is worth noticing that one could in the same way derive more 
complicated indices, e.g., the dispersion of a response pattern 


measured by x*f(x)(x)dz, and so on. 


-« 
3. LATENT CLASSIFICATIONS AND TEST CONSTRUCTION 


We have thus a precise definition of a “score” in terms of a trace 
line model. In case we can derive the distribution function $(z) 
from manifest data we then can carry out all necessary computation. 
This has been done successfully for the linear case. We know that 
K is the skewness of ¢(z). Working with four items we can derive 
the following formula: 


8 In the present paragraph, the symbol js refers to the traditional statistical definition 
of the kth moment about the mean; it is not to be confused with the definition of a link, 
i, used elsewhere in Chapters 10 and 11. The symbol M® refers to the kth moment 
about the origin. 
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A 
4j.kt * a 
where 


Aijar = [i7;kl] — wa we) 


This means that we then also know the kurtosis of ¢(x). By add- 
ing terms we can derive higher and higher moments of ¢(x) and 
therefore compute the expected value score for more complex re- 
sponse patterns; and only origin and unit are arbitrary in those 
scores. 

If we set 
(6.6) pjsed] = MLB) Bat 


the relationship (6.5) simplifies to 8. = K? +1. The identity (6.6) 
is not new to the reader of Chapter 10. It is the third condition 
necessary for the reducibility to two latent subclasses (see Section 
II, paragraph 7). The corresponding relationship between the third 
and fourth moments about the mean is not new to the mathemati- 
cian. Take the following distribution function: the population is 


; y v 
concentrated at two points: z11 = — 4/—- and z; = 4/—!; there are 
Vir Vy 


vir and vy; people at the two points respectively, with v1 + vn = 1. 
Such a distribution has a mean of zero, a standard deviation of 1 
and py = ps? + 1. 

This example permits us to see more clearly the relation between 
the general trace line model and the equivalent latent class struc- 
ture.? The latter turns out to be a special case of the former, 
originating from a specific assumption about the distribution curve 
¢(z). Obviously other assumptions can be made, or none at all. 
The discussion of scoring in the linear case, carried out in paragraph 
2 of this section, was valid for any kind of ¢(z). The consequences 
of various assumptions about ¢(z) and their relation to manifest 
data are now under investigation. The assumption of a discon- 
tinuous concentration of the population at a few points—mathe- 
matically equivalent to the existence of latent subclasses—has spe- 
cial bearing on test construction and needs some further discussion. 


* The writer wishes to thank Professors Mosteller and Savage for helpful discussions 
on this point. 
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In attitude studies we are not only interested in minute differences 
between persons as we would be, e.g., in aptitude measurement. 
Volumes I and II of the present series use large numbers of attitude 
tests, which rarely if ever classify respondents into more than a few 
groups. This is probably due not merely to lack of precise instru- 
ments of measurement. A reflection upon the nature of attitudes 
shows that it is often quite realistic to think in terms of a few broad 
classes. People certainly vary according to whether they are 
radical or conservative, whether they are prejudiced toward Ne- 
groes or not; and from day to day, from one mood to the next and 
from one social situation to the next, their attitudes usually vary 
greatly.° These variations, however, are likely to stay within cer- 
tain bounds: a soldier friendly to Negroes might occasionally be 
ruder to a Negro than to a white man, but he will not easily par- 
ticipate in a lynching. These observations are not meant to start 
a precise analysis; they merely should suggest that there is nothing 
disquieting in thinking of broader classes of attitudes, ‘‘natural”’ 
segments in an attitude continuum. 

It is, of course, possible to achieve such broader classifications by 
combining the scores of any attitude test in arbitrary intervals. 
But the present approach seems to indicate that a given set of em- 
pirical data has, so to speak, its most appropriate segmentalization. 
It is the one which first assumes the degree of trace lines indicated 
by the rank conditions of the joint occurrences of second and third 
order and then ‘‘forces’” the higher order and especially the ulti- 
mate frequencies into the “nearest’’ latent class structure. 

The procedure for a concrete study as suggested by our analysis 
would then be as follows: the first question to raise is how many 
attitude groups are needed to classify our respondents. Suppose 
we decide on four groups. Then, we will want to pick such test 
items as are likely to have polynomial trace lines of third degree. 
It was mentioned before that there is a connection between the 
wording of questions and their probable trace lines. A consider- 
able amount of experimental work will be necessary to establish 
this connection so clearly that it can be used in daily practice. 
This writer, however, can testify that it is not too difficult to make 
a pretty good guess as to the rank condition of response patterns 
from a careful study of question content. The final decision, of 
course, in every concrete test construction will have to be made in 


1° Evidence on this point will be published in a monograph by Patricia Kendall on the 
use of repeated interviews in the study of attitudes. 
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the light of actual data and therefore one will always want to start 
with more items than are finally needed in the test. Once a test 
has been found which satisfies the conditions for reducibility to 
four latent subclasses, the next step is implied in our analysis. We 
can compute a latent structure and put people into that one of the 
four classes into which they most probably belong, according to 
their response patterns. 

This procedure is likely to be greatly facilitated by a theorem 
which Louis Guttman has developed."! According to this theorem, 
the frequencies of response patterns with bimodal latent structure 
tend to become smaller as additional items of the same rank struc- 
ture are added. In response patterns with unimodal latent struc- 
ture, the highest probability tends toward 1 and the other proba- 
bilities toward 0 as more items are added. This means that it is 
possible to get as near to a precise latent classification as is desirable 
for the problem at hand. Or, to put it in other words, with in- 
creased number of items, the expected position of most of the re- 
spondents will move ever closer to one of the concentration points 
of ¢(x). The expected value score will indicate which point each 
response pattern is likely to approach. The simplest way to form 
the score in this case is to assign to the ordered set of latent classes 
a convenient series of weights and thus compute the averages of 
the latent probabilities. 

The discussion on scoring developed in paragraph 2 above thus 
applies to the equivalent latent structure as well as to the general 
trace line model. As long as our tests have only a limited number 
of items, the question will remain as to which scoring system is pre- 
ferable. If a specific set of fallible data makes it equally reasonable to 
fit a latent class structure or a continuous distribution curve, which 
would be more advantageous? In this writer’s opinion, the answer 
would be about as follows: 

If the material indicates the existence of linear trace lines, then 
much speaks in favor of a continuous model. The distribution 
curve can actually be computed ; the scores of the response patterns 
are unique and permit for all practical purposes a metrical interpre- 
tation; the relationships to other variables can be studied more 
carefully; comparisons between groups and between interviews 
taken at different times can be made in considerable detail. It is 
also easier to find items which satisfy the two conditions for linear 


1 From a written communication by Professor Guttman. 
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trace lines if one does not also require the additional condition for 
the existence of a latent dichotomy. 

But there are many research situations where linear trace lines 
are not desirable nor feasible. The former will be the case if “neu- 
tral items’’ are used of the type characteristic of a Thurstone scale. 
Linear trace lines are unlikely to exist when large groups of ‘don’t 
know”’ answers occur in the manifest material; or when the “nat- 
ural’? cut of the dichotomous questions shows a great variation. 
Once the material requires nonlinear trace lines, then it is undoubt- 
edly better to assume, wherever the data permit, a discontinuous 
¢(z). The computations are easier and the possible interpreta- 
tions of latent classes are more helpful than they are with distribu- 
tion curves which have some of their moments (in addition to the 
mean and oc) undetermined. 

Once the assumption of a discontinuous ¢(xz) has been made, it 
is, strictly speaking, no longer legitimate to use an expected value 
score (EVS) for each separate response pattern as we did in connec- 
tion with our example in paragraph 2 above. For then one puts 
the expected position of a respondent on a point where he could 
not be, according to the discontinuous assumption. But it seems 
that an EVS in this case would lead to an ordering rather similar to 
the one which would have been derived from the continuous as- 
sumption and with much less computational labor. Therefore, 
our EVS in the case of a latent class structure analysis seems per- 
missible as a substitute device. But the most adequate use of the 
latent structure is to put the respondents into the latent class where 
they most probably belong, perhaps leaving unclassified the cases 
with very similar latent probabilities in more than one latent sub- 
class. 


4, RELATION OF LATENT STRUCTURE ANALYSIS 
TO FACTOR ANALYSIS 


The present theory has a number of similarities to and differences 
from factor analysis. 

The greatest similarity is in regard to logic. Much of what 
Thurstone in his latest book has said about the role of mathematical 
models applies to both approaches. As a matter of fact, the pres- 
ent system has developed from a desire to adapt factor analysis to 
qualitative material. 

Previous writers have applied factor analysis to dichotomies in 
the following way: fourfold tables were formed between all the 
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pairs of items and then so-called correlation coefficients were at- 
tached to these fourfold tables. Once they were accepted and 
entered into a correlation matrix, the whole computing mechanism 
of factor analysis could of course be applied. But this seems to be 
an unjustifiable procedure, as can easily be seen by the fact that 
different results were obtained according to whether one worked 
with tetrachoric coefficients or the Pearson point correlations.'* 
The present approach avoids this difficulty by using the formula- 
tions of factor analysis in such a form that the notion of correlation 
is never needed. The only concept which is needed mathematically 
is the notion of independence of variables, which has a unique 
meaning for dichotomies. 

As a mathematical result the whole theory can be based on the 
analysis of joint occurrence matrices. But as has been seen repeat- 
edly, it is possible to base it also on a good analogy to the correla~ 
tion matrix: the cross-product matrix. This would make it appear 
that the present theory is nothing else than factor analysis where 
the form [%j] plays the role of a correlation coefficient. 

But this is not so, as can be shown in the simplest case of one 
factor. If in a system of tests all pairs of tests have a partial cor- 
relation of zero when one factor is partialled out, then the correla- 
tion matrix has a rank of one. How about a system of dichotomies, 
the interrelationships of which can be explained by one latent char- 
acteristic? The [%j] matrix has a rank of 1 if the latent character- 
istic is a dichotomy. It has a rank of 2 if the latent characteristic 
is a trichotomy, etc. Why does not segmentalization of the factor 
appear in factor analysis? 

As far as this writer can see at the moment, this is due to the fact 
that factor analysis assumes a linear relationship between the fac- 
tor and the test. This would correspond to a fully degenerate latent 
structure with rank 1 for the ensuing [77] matrix. If curvilinear 
relationships between the test and the factor are admitted, then 
even one factor might lead to correlation matrices with higher rank. 
The analogy here is with polynomial trace lines of higher degree. 
The latent structure of a latent trichotomy, e.g., may show a 


18 8ee George A. Ferguson, ‘The Factorial Interpretation of Test Difficulty,” Psy- 
chometrika, Vol. 6, No. 5 (October 1941), pp. 323-329; and R. J. Wherry and R. H. 
Gaylord, “Factor Pattern of Test Items and Tests as a Function of the Correlation Co- 
efficient: Content, Difficulty, and Content Error Factors,’’ Psychometrika, Vol. 9, No. 4 
(December 1944), pp. 237-244, for discussion of the difficulties ensuing from such a pro- 
cedure. 
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curvilinear relationship between the item and the underlying con- 
tinuum. 

The matter can be put in still another form. Factor analysis 
deals only with zero-order correlations. In Pearson’s correlation 
theory, the higher-order correlations do not add any new informa- 
tion; they can be computed from the zero-order correlations. In 
the present approach, M-matrices of higher signature are used. 
Joint occurrences of three or more items are empirical data and not 
mathematical derivatives from joint occurrences between any two 
items. 

The following question may be raised and offered for further in- 
vestigation: suppose a factor analyst has obtained the score values 
of a number of tests. Instead of computing the correlation coeffi- 
cients between the scores, he could dichotomize all his test results 
so that he would end up with just fourfold tables between the tests. 
He would feel that he had thrown away a great deal of information. 
But has he really? Suppose now he does not proceed to tack a 
correlation coefficient upon these fourfold tables but proceeds to 
tabulate the higher-order joint occurrences of his dichotomies up 
to the point where he can carry on a latent structure analysis. 
Where would he come out? He would know in the end the latent 
marginals of his tests which would approximately correspond to 
factor loadings. And, if his latent structure has three or more 
subclasses, he would have a pretty good idea as to the actual shape 
of the regression curve between test and factor. It should be re- 
membered, however, that for such larger \, equations of very high 
order would have to be solved, which would entail a great deal of 
work. As a matter of fact, as was pointed out before, a way to 
solve these equations generally has not yet been developed. But 
in the age of electronic computing machines, it should be possible 
for students competent in this field to tackle the basic matrix 
equations and to furnish us with manageable procedures. 

It seems then that for dichotomies the present approach forms 
the appropriate parallel to factor analysis. To suggest that it also 
be applied to quantitative material is at present a mere speculation. 
But if some examples were carried out on material which has been 
factorized before, clarification might be gained. But one need not 
be restricted to empirical studies. The following mathematical 
extension of the present approach deserves serious attention. 

Suppose the manifest data are not given as dichotomies but as 
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trichotomies. In Section III, paragraph 3, we discussed what hap- 
pens if the 3 steps are “collapsed” to a dichotomy. We could also 
keep them separate. For any two items, this would give a “nine- 
fold’”’ instead of a fourfold table. Each of these 9 combmations 
would have a trace line. We could set up accounting equations 
just as we did in Section I, only for m trichotomous items we would 
now have 3” such relationships between manifest and latent data. 
It can be easily shown that all the algebra developed in the preced- 
ing chapter can be directly applied. Suppose we speak in terms of 
answers ‘‘Yes,” “No,” and “Don’t know.” First, we would con- 
sider the answer “Yes” as a positive occurrence and compute the 
corresponding latent marginals. Then the same would be done for 
the answer “‘No.” The latent marginals for the answer ‘Don’t 
know” would come out by subtracting the first two sets from a 
total of one. The amount of labor would be greatly increased, 
especially because additional adjustments would become necessary. 
Furthermore, it is very dubious whether much would be gained by 
this complication; but no new theoretical element or difficulty 
would be added. 

Now, extend this idea not to 3 steps for each item, but to many, 
say, w; for item7. Then we would be in the situation of “‘continu- 
ous” variables with w; intervals. Each combination of test scores 
would furnish one accounting equation. At this point a reformu- 
lation of the basic algebra would be necessary, and no work has yet 
been done in this direction. But this extension, if it turns out to 
be mathematically feasible, would lead to a general factor analysis 
of which the Thurstone tradition would form the continuous linear 
case; latent structure analysis would be its specific application to 
dichotomous data without restriction as to linearity. The latter 
approach, restricted as it is, still has many tasks waiting for the 
mathematician as well as for the empirically minded social scientist. 
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CHAPTER 12 


TWO CASE STUDIES IN PREDICTION: 
INTRODUCTORY COMMENTS'* 


NE of the objectives of social science is to develop techniques 

for prediction which will enable the user to predict more 

efficiently than if these techniques did not exist. By more effi- 

ciently we mean that the percentage of correct predictions is higher, 

on the average, than would otherwise be obtained for the same ex- 
penditure of effort. 

The Research Branch frequently in its surveys was in the position 
of having to make informal or implicit predictions. For example, 
the interpretation of any observed relationship between two vari- 
ables usually carries with it the implication that the association 
found at a given time and place will be found at some future time 
and place. In some cases, especially when controlled experiments 
have been used as in the data constituting Volume III of this series, 
the results carry the additional implication that if one manipulates 
X, one can expect a change in Y—in other words, that X can be 
regarded now and in the future, as a “cause of Y.” All too often 
to satisfy the scientific consciences of members of the Research 
Branch, predictions forced by military exigencies had to be made 
when the data were quite inadequate. 

Various efforts at prediction have been mentioned in Volumes I 
and II. One of the more ambitious, and also one of the less scien- 
tific, was the elaborate forecast reported in Volume II, Chapter 12, 
of general psychological conditions to be expected with the defeat 
of Germany. Much of the content of the predictive memorandum 
quoted there in full deserves to be classified as “enlightened com- 
mon sense” rather than as a forecast directly arising either from 
scientific principles or from data permitting actuarial induction. 
Yet some principles were used and available data were taken into 
account as far as possible—and, with exceptions duly noted in the 
chapter, the predictions hit the mark fairly well. 

1 By Samuel A. Stouffer. 
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Because the improvement of methods of prediction is so impor- 
tant in social science, it seemed worth while to present at some length 
in the present methodological volume two detailed case studies of 
prediction in which an explicit effort was made not just to forecast 
what certain general tendencies would emerge, but actually to 
specify the proportions of men who could be found later to behave 
in such and such a fashion. 

First, in Chapters 13 and 14, by Shirley A. Star, we shall take the 
reader through the stages in the construction of a pencil and paper 
test for screening psychoneurotics. The final form of this test was 
used routinely in the last year of the war at all induction stations 
in the United States. 

Second, in Chapters 15 and 16, by John A. Clausen, we shall 
examine the problems involved in predicting what kind of employ- 
ment men would seek upon discharge from the Army, as well as 
how great the demand would be for further schooling. 

These studies should be instructive for the student of social sci- 
ence. They are by no means models of clean and simple predic- 
tion. The complexities of the problems and the ambiguities of the 
criterion situations called for much technical ingenuity and, in the 
final analysis, for the exercise of some enlightened guesswork. In 
both instances, the Branch felt that the value of the results more 
than justified the effort. Many pitfalls were anticipated and suc- 
cessfully evaded, others were not. By a frank recording of these 
accounts of prediction, it is hoped that future students can help 
equip themselves to do analogous tasks better. 

As is so often the case with problems of prediction, the major 
difficulty lay in the lack of clarity as to the criterion situation. If 
one is using a poll to predict a presidential election, one has a simple 
and definite criterion in the number of votes actually cast for each 
candidate at the presidential election. If one is predicting, how- 
ever, on the basis of test scores, which men psychiatrists will reject 
for the Army after a psychiatric examination at induction stations, 
it turns out that the criteria are variable indeed. 

In Chapters 13 and 14 a study is reported of over 100,000 men 
constituting the entire population examined at all the induction 
stations in the United States in August 1945. For each man we 
have data on his score on the Neuropsychiatric Screening Adjunct 
(NSA), i.e., the screening test developed by the Research Branch for 
the Surgeon General, and on the disposition made by the examining 
physicians. There we see a rather astonishing fact—that, although 
the test scores had much the same frequency distribution through- 
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out the United States, the proportions of men rejected on psychia- 
tric grounds varied all the way from 0.5 per cent at one induction 
station to 50.6 per cent at another. 

Not only did the psychiatric rejection rate fluctuate from one 
induction station to another, but so also did the reasons for psy- 
chiatric rejections. If, for each station with fifty or more psychia- 
tric rejects, we set all psychiatric rejects equal to 100 per cent, we 
find that the proportions among these psychiatric rejects classified 
as psychoneurotic varied from 2.7 per cent to 90.2 per cent, while 
the proportions classified as psychopathic varied from nothing up 
to 81.3 per cent. Such variations may seem almost fantastic, but, 
of course, one must remember that psychiatry is still far from an 
exact science, that the number of competent psychiatrists is not too 
large, and that the time which physicians could devote to the 
examination of any one man was extremely limited. 

Variations were especially large in the South, as might be ex- 
pected due to lack of trained psychiatrists. Thus at the induction 
station in New Orleans, 74.6 per cent of the psychiatric rejects were 
found to be psychoneurotic, but only 22.7 per cent were at Okla- 
homa City. Only 2.2 per cent of the psychiatric rejects in New 
Orleans were found to be psychopathic, as compared with 70.9 per 
cent in Oklahoma City. But such variations were not confined to 
the South. In Boston, 57.9 per cent of the psychiatric rejects were 
found to be psychoneurotics; in New Haven only 25.0 per cent. 
In Chicago the figure was 71.8 per cent; in Detroit, 31.5 per cent. 
In San Francisco the figure was 69.1 per cent; in Seattle-Portland 
the figure was 22.9 per cent. 

The scoring of the screening test was so designed that about a 
third of American young men, on the average, would manifest signs 
indicative of the need of very careful psychiatric examination. 
This does not mean, as is fully explained in the next two chapters, 
that all these men should have been rejected. Rather, the cutting 
point was set high enough to include among the one third practically 
all of the psychiatric cases, as well as some who, upon careful ex- 
amination, should be accepted by the Army. Part of the predic- 
tion problem was to find whether or not this one third did include 
most of the eventual psychiatric rejects. 

Actually, as is shown in detail in Chapter 14, this one third 
screened by the test included 80.8 per cent of the psychoneurotic 
rejects, 68.2 per cent of the psychopathic rejects, 70.8 per cent of 
the psychotic rejects, and 56.4 per cent of all other psychiatric re- 
jects. All told, it screened 69.5 per cent of all psychiatric rejects. 
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At the same time, it screened, as cases also worth examining, 21.8 
per cent of the men eventually admitted and 30.3 per cent of the 
men rejected by the physicians on other than psychiatric grounds. 

In spite of the very great variability from one induction station 
to another in psychiatric diagnosis, there was no station with as 
many as fifty psychiatric rejects in which the test did not screen at 
least half of those eventually rejected for psychiatric grounds, and 
in all but four of these stations the proportions screened were be- 
tween 60 per cent and 96 per cent. 

The tables shown in Chapter 14 have, therefore, an importance 
transcending a mere exercise in methodology. The variability of 
psychiatric diagnosis is not exactly news to psychiatrists, but until 
this study was made there was no uniform scoring device against 
which the variability throughout the United States could be com- 
pared. Hence, if there was three times as high a psychiatric rejec- 
tion rate in one city as in another, there was no answer if somebody 
chose to argue that maybe one city did have three times as many 
psychiatric cases in proportion to its population as another. 

How the Research Branch data throw light on such discrepancies 
may be illustrated in detail by a comparison of data from the Chi- 
cago and Detroit induction stations: 


Chicago Detroit Number of cases 
Proportion rejected on psychiatric 


grounds among all men examined 78% 21.6% 4,628 
Proportion screened by NSA as 
worth careful examination 24.1 26.9 4,236 


We see that, in spite of the fact that the test scores showed about the 
same proportion in the two populations as needing careful examina- 
tion (24.1 per cent in Chicago and 26.9 per cent in Detroit), the 
Chicago psychiatric examination rejected only 7.6 per cent while 
the Detroit examination rejected nearly three times as many. Let 
us continue: 


Chicago Detroit 
Psychiatric diagnosis of 
all psychiatric rejects: 
Psychoneurotic 71.8% 31.5% 
Psychopathic 10.7 39.0 
Psychotic 1.2 _ 
All others 16.3 29.5 


100.0% 100.0% 
Total number of rejects S44 916 
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Not only did Detroit reject three times as many as Chicago, but also 
among the rejects the “causes” of rejection were very different. 

Because the distribution of test scores at the two stations was - 
about the same, yet the psychiatric diagnoses were so different, one 
cannot expect to find the test very efficient in predicting psychia- 
trists’ behavior. We have the following findings: 


Chicago Detroit 
Proportion screened by NSA as 
worth careful examination, among: 

Men later accepted for military service 19.5% 15.38% 
All psychiatric rejects 64.6 60.9 
Psychoneurotic rejects 69.0 88.8 
Psychopathic rejects 62.2 58.2 
Psychotic rejects 75.0 — 
All other rejects 42.9 35.3 


There can be no doubt that the test tended to screen a much larger 
proportion at each induction station among those subsequently re- 
jected on psychiatric grounds than among those accepted. That 
the discriminating power of the test was as effective as these data 
show is all the more noteworthy in view of the widely divergent 
standards of psychiatric diagnosis, acceptance, and rejection which 
it seems probable were applied at the two stations. 

The same kind of discrepancies, as between Chicago and Detroit, 
are also seen all over the United States, as the tables in Chapter 14 
vividly portray. In general, the relationship of the NSA test 
scores to the psychiatric diagnosis was about the same elsewhere as 
at these two stations; on the average, somewhat more discriminat- 
ing but not much more. 

Thus we have illustrated a central problem of prediction in social 
science—namely, the difficulties of making predictions when the 
behavior to be predicted (in this case, actually, the behavior of 
psychiatrists with respect to the individual tested) is itself unstand- 
ardized and subject to great variation. 

The second case study in prediction to be examined in detail in 
this volume also is one whose major difficulty lay in the differences 
between the test situation and the criterion situation. Here our 
problem was to predict what men would do after the war. It was 
of much practical importance to get this information as accurately 
as possible. Congress, in budgeting for the GI bill, needed a basis 
for some kind of estimate of the total cost of GI benefits for educa- 
tion. The Veterans Administration, Department of Agriculture, 
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and many other government agencies needed data for future plan- 
ning. Various branches of the War Department were involved in 
vocational guidance and in service educational programs, and were 
particularly anxious to find out two facts: the extent to which 
interest in various types of vocational careers was manifested ; and 
the extent to which this interest represented realism or mere wishful 
thinking. 

To ask a man fighting in the Pacific two years before the end of 
the war what he expected to do when the war was over—a war 
which might last many years as far as anyone knew—involved con- 
siderable temerity. The asking itself was simple, but the framing 
of questions and interpretation of the answers was another matter. 

In Volume II, Chapter 13, some of the findings of these studies 
have been reported. But the steps involved in arriving at sets of 
questions which it was hoped would be satisfactory represent a re- 
search saga worth recording for its methodological interest. Many 
of the men had only the vaguest of plans and some had alternative 
plans, of which one or more may have been quite unrealistic. 
Moreover, nobody knew what the labor market would be like after 
the war. Some economists were predicting a depression, with mil- 
lions unemployed. Whether or not soldiers were to be discharged 
into a country with factories running full blast or into a country 
with millions of people idle obviously was a contingency which 
could not be ignored in making predictions. 

Chapters 15 and 16 trace the stages of the development of ques- 
tionnaires for the analysis of postwar plans and summarize the re- 
sults of two special studies, made at the close of the war, in which 
soldiers who had filled out Research Branch questionnaires were 
followed up several months after discharge. The follow-up studies 
provided a check on the kinds of errors in prediction made for each 
individual, where errors were made. 

In both the studies of psychiatric diagnosis and the studies of 
postwar plans will be found some interesting examples of the appli- 
cation of scalogram analysis described in earlier chapters of this 
volume. The psychiatric example is especially interesting in this 
respect, since, incidentally, it provides some unique data for social 
psychologists who are interested in theories of abnormal psychology. 
As a result of many conferences with psychiatrists, fifteen scales or 
quasi scales by the scalogram method were constructed to cover a 
wide variety of problems thought to be involved in psychiatric 
breakdowns in the Army. The areas represented by the fifteen 
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scales or quasi scales in rank order of their discriminating value, 
as between psychoneurotics and normals, are: 


1. Psychosomatic complaints 8. Participation in sports 

2. Personal adjustment 9. Worrying 

3. Childhood symptoms 10. Childhood fighting behavior 
4. Soldier role 11. Relations with parents 

5. Childhood fears 12. Identification with war 

6. Sociability 13. Childhood school adjustment 
7. Oversensitivity 14. Mobility 


15. Emancipation from parents 


One of the most remarkable results was the fact that a single one of 
these fifteen areas, namely, a quasi scale on psychosomatic com- 
plaints, was almost as discriminating as between psychoneurotics 
and others as the entire battery of items covering fifteen areas. 
Because of this fact the final test forms developed could be reduced 
to only a few items. 

It may be noted in passing that attempts made by psychologists 
in the Research Branch to use modified projective techniques like 
the group Rorschach proved comparatively unrewarding.? This 
is neither a reflection on the theory of projective testing or on the 
practical utility of individual projective tests. It should be ob- 
served that the military situation was one in which there was no 
disadvantage to a malingerer in giving the ‘neurotic’? answer even 
if he suspected it to be such. In other situations, involving appli- 
cation for private employment, for example, the opposite might be 
true; and in such situations projective tests, in improved form one 
can hope, may be found to be indispensable. 

The postwar plans research is particularly valuable in serving as 
& warning against blind and indiscriminate use of scales. In some 
respects scales, of the kind described in the earlier chapters of this 
volume on scalogram analysis, were helpful in ordering respondents 
along single continua, such that each rank group had a higher prob- 
ability of carrying out the predicted behavior than the rank group 
immediately lower. However, there may be a temptation for 
future students to become so devoted to the application of scaling 
that they may fail in the vitally important task of analyzing fully 

* Actually, the Harrower-Erickson multiple choice Rorschach test proved to have no 
greater discriminating power than a single check-list item like, ‘““Are you ever troubled 
with nervousness?” The Research Branch experimented with a brief type of open-end 


response to the Rorschach slides, which was much more promising, but would have re- 
quired more time than was available to bring it to a stage of practical utility. 
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the situation in which they want to apply their scales. A careful 
study of Chapters 15 and 16 should, therefore, serve as an instruc- 
tive corrective—at least in the present early stages of the develop- 
ment of scale theory. 

The research reported here required decisions both by the re- 
spondent and by the analysts on the basis of weighing a good many 
values and contingencies. Items not fitting into a scale needed to 
be taken into account in assessing the probable materialization of 
contingencies and the implications of responses to questions getting 
at such contingencies. For example, a respondent may have defi- 
nitely and strongly wanted to go back to school, but felt that he 
might not go back if he were lucky enough to get a certain specified 
kind of job. Taking this response at its face value, the prediction 
problem involves estimating the chances of his actually getting such 
a job. It is quite clear that the latter estimation depends in turn 
upon a forecast of the employment situation in general, of the prob- 
able openings for such jobs in particular, and of the chance that the 
respondent would qualify for such a job if it were available. Only 
if we are prepared to make such guesses (which may involve proce- 
dures for which scaling has not as yet been adapted) can we assign 
such a man with confidence to a scale position with respect to his 
likelihood of returning to school. We may, it is true, fit him into 
a broad scale category comprising all those who would go if they 
cannot get the job they want. But we do this at a sacrifice to pre- 
dictability, since some men in such a category may have a high 
probability of getting that alternative job and some may have a 
low probability. 

Chapters 15 and 16 provide some concrete data on the advantages 
and limitations of various types of questions for eliciting needed 
information as to a man’s future intentions. 

For example, at one time it seemed wise to begin with a very 
general free-answer question like, “What do you expect to do after 
the war?” Such a question had the advantage of encouraging a 
free answer unstructured and unbiased by its format—the nondi- 
rective approach. But it was not particularly useful, because 
many men answered it in terms of marriage or in terms of where 
they expected to live or in terms of recreation rather than in terms 
of occupation. Somewhat more useful was a question of the fol- 
lowing type: “What kind of work do you think you will do right 
after the war? (Write down the name of the job and describe it as 
fully as you can.)’’ This brought responses in terms of job plans, 
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but was biased against the reporting of plans for further schooling. 
When schooling also was included in the format of the question, 
however, it was found to bias the answers too much in the direction 
of further schooling. Moreover, the problem of coding the free 
answers on thousands of questionnaires from all over the world 
made the cost of processing such free answers exceedingly heavy. 
The free-answer question, however, was frequently helpful in re- 
solving ambiguities in answers to more direct questions. 

The free-answer question was usually followed by a set of explicit 
check-list questions such as: 


Do you think you actually will go back to work for the same employer 
(company, person, etc.) you worked for before you came into the Army? 
Yes, I’m quite sure I will 
—____— I may, but I’m not sure 
No, I don’t think I will 
I worked for myself before I came into the Army 
I was not working before I came into the Army 


Sometimes, several such questions in a given area were asked in 
order to get a scale score. These questions were very useful, but 
they contained a large element of suggestion and there were a good 
many respondents who checked “Yes, I’m quite sure I will’ to 
apparently mutually incompatible questions. All answers, of 
course, could have been true if each represented a different period 
in future time, but, in many instances, as further querying showed, 
they often represented simultaneous alternatives or wishful thinking. 

To get at such alternatives, explicit contingency questions were 
introduced, e.g., “If you could get a good job, but not the kind of 
job you want, would you go back to full-time school?” It turned 
out that such contingency questions were not always answered in 
literal terms of the contingencies offered and they presented con- 
siderable difficulties in analysis, although they too were often neces- 
sary to ‘“‘peel off” unrealistic responses to the more direct questions. 

Perhaps the simplest device for classifying individuals who were 
considering alternative courses of action was a check list of alterna- 
tives such as the following: 


Check the one thing you probably will do first of all—after you have 
gotten your discharge and taken a vacation. 
I will probably work for the employer I was working for before 
I came into the Army 
——_—- I will probably work for salary, wages, or commission for some 
other employer 
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____— I will probably go to full-time school or college 
——_—— I will probably farm for myself or with my family 
____— I will probably go into business for myself 
——_—— I am undecided what I will do 


Such a question was easy to tabulate, and it could be used to spot 
respondents who were basically undecided without having to do an 
elaborate coding job. But it was not free from bias, for there was 
a tendency to check prestige items, like going into business, rather 
than an item closer to actual expectations. Hence, the analysis of 
answers to individual items, like “Do you think you will do farming 
when you get out of the Army?” followed by a series of further 
detailed questions to establish, by scale analysis or otherwise, the 
realism of an affirmative answer, was usually a more efficient pro- 
cedure for establishing predictive cutting points than the simple 
alternative check list. 

The data in Chapters 15 and 16 permit one to study these various 
types of approaches in terms of the degree to which they actually 
predicted or failed to predict subsequent behavior. It so happened 
that the employment situation in private industry was excellent 
after the war, although various factors, including shortages of mate- 
rials, inhibited the opening up of new small businesses. In spite 
of the good labor market, the incentives to continue education were 
high, as Congress successively increased the allowances in the GI 
bill. On the whole the predictions were quite successful—especially 
with respect to working for an employer in private industry, with 
respect to farming, and with respect to going back to school ; some- 
what less so with respect to setting up one’s own business or getting 
government jobs. Some of the reasons for this differential effec- 
tiveness are brought out clearly in the analysis. 

In the chapters which follow, primary emphasis has been placed 
on describing the methods used, and the content findings are treated 
as incidental. To describe the operations adequately, a full-sized 
book would be needed for each of these two case studies in predic- 
tion. The condensation required by limitations of the present 
volume has been achieved at some cost to the student, who will 
have to follow the description rather patiently on some of the pages 
which are closely written. 

It should be emphasized that these chapters represent concrete 
illustrations of problems of prediction. Many more such bodies of 
illustrative materials need to be assembled before we shall be ready 
to enunciate rules or principles about the conditions under which 
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prediction is or is not most likely to be successful. Such an effort 
should not be crudely empirical but should flow from considera- 
tions arising out of maturing theories of learning. 

John Dollard, while a consultant in the Research Branch, made 
an incisive attempt to systematize some preliminary thinking about 
this problem.* Dollard visualized the problem as a relationship 
between three classes of situations, which he called the origin, test, 
and criterion situations. The origin situation is one in which the 
thought or opinion sentence is first hit upon and rehearsed. In 
the test situation the social surveyor intervenes with questions which 
tap a stream of thought sentences or opinion sentences presumably 
orienting behavior of the person surveyed. The criterion situation 
is the dilemma around which prediction is centered and in which 
the respondent must take whatever rewards and punishments 
ensue upon his action. 

Prediction is possible, as Dollard sees it, to the extent that there 
is some correspondence between these three situations. There is 
in all of our social conditioning from early childhood a strain to 
match verbal expression with overt behavior and also thoughts 
with overt behavior. Children are trained in rehearsing directions 
received from parents and policed to see that they follow these 
directions correctly. Even when a child is asked to solve an arith- 
metic problem or to spell a word, “he goes through a process of 
inner rehearsal which is a kind of prediction of what the explicit 
answer will be. If his predictions prove frequently to be invalid, 
the psychologist may lay on him the curse of Binet and state that 
he has a low IQ.” 

Dollard advanced, very modestly, some preliminary hypotheses 
as to the conditions which seem to favor matching the opinion in 
the test situation to action in the criterion situation. Prediction, 
he suggested, is favored ?f: 

1. In the test situation no relevant response element is disconnected 
from thought and speech. If some relevant elements are discon- 
nected (i.e., unconscious) these elements will remain at play in the 
criterion situation though not registered in the test situation. 
Example: a man is not aware of his jealous tendencies toward his 
wife because they have never been excited. He might think, upon 
~ *8ee John Dollard, “Under What Conditions Do Opinions Predict Behavior?” 
Public Opinion Quarterly, Vol. 12, No. 4 (Winter 1948-49), pp. 623-632. This paper, 
originally prepared while the author was serving in the Research Branch, was read be- 


fore a joint meeting of the Washington Statistical Society and the Washington chapter 
of the Institute of Mathematical Statistics, March 9, 1944. 
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questioning, that he would not be jealous should a rival appear, 
but might find himself painfully so in the event. 

2. If the sentences of the respondent cover all response elements rele- 
vant to the criterion situation. Persons with poor verbal skills may 
find it difficult to forecast their behavior, even when no automatic 
repression exists as in the example above. 

3. If the connection between sentence responses and overt action re- 
sponses has been strongly rewarded in previous learning. In some 
persons, who have not been rewarded for thinking first and then 
acting, a split occurs between thought and action. They are dream- 
ers who do not make their dreams come true, the “phantasy” types. 

4. If the test situation provides stimuli only to the verbal response 
relevant to the criterion dilemma. The test situation should not be 
corrupted by extraneous threats or rewards. For example, a fac- 
tory worker may deny that he intends to join a union out of fear 
that the interrogator represents the boss. 

5. If the origin situation corresponds closely to the criterion situation. 
A man can best predict what he will do in a future situation if he 
has been in about the same situation before and thus knows what 
it’s all about. In the polling-election situation prediction is highly 
favored. The question, “Whom will you vote for?” is about the 
same in the two situations. But, as in so much of the illustrative 
material in Chapters 15 and 16, the test situations and criterion 
situations may have no such simple correspondence. 

6. If no new origin dilemma intervenes between the test and criterion 
situation. A new experience arising between the two situations can 
upset the prediction. For example, a Southerner coming North 
for the first time might have the opinion that he would never sit by 
a Negro on a bus; but once in the North a new learning dilemma 
occurs, since the Southerner does not want to appear strange or 
bigoted and his Southern penchants against breaking caste etiquette 
have been withdrawn for the time being. A new origin experience 
has come into being. 

7. If the test question explicitly presenis the conflict, 1.e., antic- 
ipations of rewards and punishments, of the criterion dilemma. A 
man will better predict what he will do in a future dilemma if he is 
told exactly what this dilemma will be. This condition cautions 
particular care in evoking the criterion situation vividly and spec- 
ifying exactly the behavior to be predicted. 

More efforts of this kind, to systematize thinking about the con- 
ditions of successful predictability, are needed. It is hoped that 
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the experience recorded in chapters following will serve, along with 
other such documents, as source material for the development of 
theories of prediction which, in Dollard’s language, will help “relate 
the data of the social survey to the framework of a more general 
science of human behavior.” 


CHAPTER 13 


THE SCREENING OF PSYCHONEUROTICS 
IN THE ARMY: TECHNICAL 
DEVELOPMENT OF TESTS 


HIs chapter is in three parts. Part 1 traces the process by 

which the Neuropsychiatric Screening Adjunct, used at all 
induction stations in the last year of the war, was developed. Parts 
2 and 3 are, in a sense, appendixes. In Part 2 each item used in 
the initial study in the Army to discriminate between patients in 
psychoneurotic wards in station hospitals and a cross section of the 
Army is presented, with the percentages of psychoneurotics and 
“normals” endorsing each response category. In Part 3 the tech- 
nical procedure for weighting items is described in detail. 


PART 1 


THE LOGIC AND PROCEDURE LYING BEHIND THE 
NEUROPSYCHIATRIC SCREENING ADJUNCT 


The General Concept of Screening 


The theory of screening tests is not particularly new;? they are 
merely devices for cutting off a portion of a population in such a 
way that it contains a disproportionate number of some subpopu- 
lation to be detected. To put it in the concrete terms of psychiatric 
screening, we may, of course, assuming infallible psychiatric diag- 
nosis, discover every psychoneurotic in a given population by indi- 


1 By Shirley A. Star. The initial studies of psychoneurotics were begun by Robin M. 
Williams, Jr., and Louis Guttman in cooperation with Major John Appel, representative 
of the Neuropsychiatric Consultants’ Division of the Surgeon General’s Office, who had 
requested that the research be done. After the first pretest, Shirley A. Star succeeded 
Dr. Williams on the team. Dr. Guttman contributed the mathematical techniques 
employed and discussed here. The empirical work and later validation studies were 
under the direction of Miss Star. 

*See W. A. Hunt, C. L. Wittson, and H. I. Harris, “The Screen Test in Military 
Selection,” Psychological Review, Vol. 51, No. 1 (January 1944), pp. 37-46. 


486 


SCREENING OF PSYCHONEUROTICS 487 


vidual psychiatric evaluation of each member of that population. 
On the other hand, this is a long and costly procedure, and often 
the number of psychiatrists needed for it is not available. A psy- 
choneurotic screening test aims to economize psychiatrists’ time 
by reducing the number who must be individually examined with- 
out a proportional reduction in the number of psychoneurotics who 
are examined. 

From this standpoint, any stimulus to which there is a significant 
difference in response as between psychoneurotics and nonpsycho- 
neurotics may be used as the basis of a screening test.’ If the re- 
sponses are suitably scored, and the men to receive psychiatric 
examinations are selected by reference to their scores, then the 
group examined by psychiatrists will contain a higher proportion 
of psychoneurotics than the population as a whole. Where the 
psychiatric facilities permit an individual examination of only a 
portion of the population, then selection of that portion by means 
of a screening test brings greater returns than random selection. 
Just what proportion of the population is to be considered screened 
by the test and referred for psychiatric evaluation is, where a range 
of scores exists, a matter for empirical decision, depending, on the 
one hand, on how many men can be individually examined and, on 
the other, on what margin of error in failing to uncover psycho- 
neurotics is permissible. 

In general, screening tests have either striven for criterion-exclu- 
sive or criterion-inclusive screening. In the case of criterion-exclu- 
sive screening the goal is to select a group all of whom have the 
desired characteristic without necessarily including within that 
group all who have that trait. For example, tests of vocational 
ability have usually assumed that there would be more applicants 
than openings and the function of screening tests was to select a group 
equal in size to the available jobs, all of whom would succeed at the 
job, without too much concern that other possibly successful men 
were excluded. On the other hand, criterion-inclusive screening 
strives to include within the selected group all persons who have 
the desired characteristic even though not all persons in the group 


* Though paper and pencil testa, of which our test is one, are perhaps most often em- 
ployed, such sensory-motor measurements as body sway and dark vision have also been 
successfully used. See, for example, H. J. Eysenck, “A Comparative Study of Four 
Screening Tests for Neurotics,”’ Psychological Bulletin, Vol. 42, No. 9 (November 1945), 
pp. 659-662. 
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will have that characteristic.‘ To present this difference graphi- 
cally: 


Screened group - criterion - 
exclusive screening 


Screened group - criterion - 


inctusive screening Total population 


Psychiatric screen tests have usually attempted inclusive screen- 
ing; it has generally not been considered useful to be able to screen 
out a group all of whom were psychoneurotic; rather, some reason- 
able assurance has been wanted that all psychoneurotics would be 
screened from the population. From the standpoint of the prac- 
tical problem with which the Research Branch was confronted— 
viz., to develop a psychiatric screening test to be used in induction 
station examination of men for the Armed Forces—this was cer- 
tainly the case. The question there was not to select out a group 
of psychoneurotics, nor was it simply to decide how much time 
could be given to psychiatric evaluations of individuals and then to 
select the number of men who could be processed in this time in such 
a way as to maximize the likelihood of their being psychoneurotic. 
Since their goal was the detection and elimination of every disquali- 
fied psychoneurotic, psychiatric examiners at induction stations saw 
every man, reducing the amount of time given to each man rather 
than reducing the number of men seen. In order for a screening test 
to be useful in a situation like this, it had to aim at screening at least 
as many of the psychoneurotics as the brief individual examinations 
uncovered while reducing the number of men to be seen so that 
their examinations might be more thorough in the face of an acute 
shortage of psychiatric personnel. 

Inclusive screening of this kind is adaptable only to situations in 
which (1) there are ways of evaluating the screened group in order 
to separate the men who have the screening criterion trait from the 
men who were falsely screened in the effort to make the screened 
group inclusive and (2) there can either be no cheating or where 
~ #Inclusive and exclusive screening, of course, represent only the two ideal poles of a 


continuum. With a fine enough distribution of scores, actual screening procedures can 
be set at almost any point between. 
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cheating will simply place the malingerer in the group to be sub- 
jected to further evaluation. Psychiatric screening in the Army 
sought to meet these conditions: the group screened were not to be 
thereby rejected but simply referred for individual diagnosis, and 
in so far as there was a problem of malingering, it was primarily 
that of men trying to avoid service by claiming psychoneurotic 
symptoms rather than that of men trying to hide their psychiatric 
disturbances in order to enter the Armed Forces.® 

As long as screening is regarded simply as a practical tool, the 
question of “false negatives’”—men who are screened without pos- 
sessing the desired trait—can be regarded thus summarily. A 
psychiatric screening test to be useful does not need to essay an 
individual prediction; except in the formal sense that it precedes 
the psychiatrist’s evaluation and predicts his behavior,® detection 
rather than prediction is involved. If we wish to predict or to 
make individual diagnoses (which means, in this case, to contem- 
plate the possibility of tests capable of replacing psychiatrists), 
then, of course, false negatives become as important as false posi- 
tives (ie., psychoneurotics who are not screened by the test) in 
indicating the sources of imperfection in the screening instrument. 


Design of the Initial Study 


The design of the study from which was derived the screening test, 
later named the Neuropsychiatric Screening Adjunct and officially 
adopted for use at all induction stations, was a comparison of 
responses of a cross section of the Army with those of a sample of 
psychoneurotic patients in Army hospitals to a questionnaire cov- 
ering fifteen areas which the literature and psychiatric advice sug- 
gested would prove to be discriminating. This design had certain 
weaknesses, among which is what came to be the major question 
of whether the test could be transferred to the induction station 
situation.” The discussion will, however, be easier to follow if the 
critique of the design is reserved until the reader has a clear idea of 


5 Homosexuals were a possible exception to this generalization. 

* And note that even for his behavior the prediction is only that he will find a higher 
proportion of the screened group to be psychiatrically disqualified than of the un- 
screened group, which implies that he could find the majority of the screened group 
acceptable without damaging the prediction. 

7The study was not originally planned to yield such a screening test, but rather was 
later adapted to this purpose, a historical circumstance which serves to explain why the 
design was not more effectively molded to the problem. Before application at induction 
stations, the NSA was, however, tried out on samples of men passing through induction 
stations, as we shall see, in addition to the Army population. 
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the criterion situation in which the test was evolved and of the test 
itself. 

The major psychiatric concern of the Army was not to exclude 
from service every man who was judged by psychiatrists to be 
psychoneurotic. Rather, psychiatric examiners were asked, in 
effect, to make a prediction and exclude from service only those 
psychoneurotics who they felt would prove to be incapable of ren- 
dering effective military service in any capacity. While the induc- 
tion into and discharge from the Army of psychoneurotics in prac- 
tice varied with the availability of man power, nevertheless, in 
policy, at least, psychoneurosis was never considered to be grounds 
for rejection or discharge unless it rendered a man incapable of 
performing ‘‘a reasonable day’s work for the Army” in some assign- 
ment. It was for this reason that the group of psychoneurotics 
currently ineffective, i.e., patients in hospitals, were chosen as the 
criterion group rather than using psychiatrists’ diagnoses of men 
on duty to sort them into “normal’’ and psychoneurotic subgroups.® 

This criterion of ineffectiveness left some unresolved problems, of 
course. At any given moment, men in the Army could be perfectly 
classified as effective or ineffective: either they were present for 
their regular duties or they were not. But it is equally obvious 
that at any given moment, the effective group contained some men 
who would later prove to be ineffective, while some of the ineffective 
group would return to duty. Nevertheless it was felt that a psy- 
choneurotic breakdown under conditions of domestic Army service 
could at the very least be regarded as a bad prognosis for future 
effectiveness under possibly more trying conditions so that this 
group did not depart too far from the ideal concept of “‘ineffective- 
ness.” And the cross section of the Army, though it contained 
some men who would prove to be psychiatric ineffectives, could be 
regarded as containing them in such negligible proportions as to 
approximate the polar concept of effectiveness.'® 


*See War Department Circular No. 164 (April 1944), which further provided that 
men with “pronounced psychiatric disorders” should not be sent overseas, but that men 
with other psychiatric disorders might be sent overseas and, where the disorder was 
“mild peychoneurosis, transient in character,” could be assigned to combat duty. 

* Ultimately, of course, a man’s being a psychiatric patient in an Army hospital was 
also dependent on a psychiatric diagnosis. But as the Army operated, such men were 
not sought out, diagnosed, and hospitalized. Rather it would only be after exhibitions 
of gross inefficiency, constant going on sick call, etc., that these men would be culled out. 
They would, therefore, represent a more clear-cut group of ineffectives than a group 
selected by psychiatrists from among men on duty as likely to break down. 

1 Just how much and what kind of service a man should render to be considered an 
“effective” is itself not precisely defined. Clearly a man should complete training or 
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Once the criterion group was defined as those men who proved 
incapable of serving usefully in the Army because of psychoneurosis, 
there remained the problem of sampling the two universes. The 
cross section of the Army, to be regarded as the opposite pole, was 
selected in accordance with the sampling procedures usually em- 
ployed by the Research Branch, but the psychoneurotic sample 
was more difficult. At the time of this study (January-February 
1944) men who had been hospitalized for psychoneurosis in the 
Army could be found in six places: back at duty, with or without 
supplementary outpatient treatment; in special rehabilitation 
centers, where men received modified training and outpatient treat- 
ment; in station, regional, or general Army hospitals; or back in 
civilian life. However, when psychoneurotics came to the attention 
of medical officers in the United States and hospitalization was 
deemed necessary, these men were referred, as a first step, to a sta- 
tion hospital, that is, a hospital serving only one post or camp. 
After treatment and/or diagnosis here, cases were disposed of in one 
of the four ways suggested above; i.e., men were sent back to duty, 
or assigned to special centers, or referred to other hospitals, or 
discharged from the Army. Obviously, then, sampling at the level 
of station hospitals, which were the clearinghouse for all Army 
psychoneurotics who were hospitalized, would yield the most unse- 
lected population of hospitalized psychoneurotics from the stand- 
point of future disposition. While the use of a more selected sample 
—say, psychoneurotics to be discharged—might have more closely 
approximated the criterion of ineffectiveness, the station hospital 
population was nevertheless chosen for several reasons. 

In the first place, to use a group whose final disposition was known 
would have introduced another variable—viz., the possibility that 
their responses were influenced by their knowledge that they were 
to get out of the Army if that group were chosen, or their presump- 
tion that this would be the outcome if men in regional or general 
hospitals were selected. It seemed unwise to introduce a factor 


he has not been able to perform a “reasonable day’s work” for the Army. But if he 
completes training and performs a year’s work at some job before he has a psychiatric 
breakdown is he to be regarded as a potential ineffective who should have been excluded 
from the Army in the first place or did the year’s service which he was able to render 
justify induction? At the other extreme, men who break down in combat cannot be 
regarded as men who should have been excluded from the Army in the first place, for 
we know enough about combat breakdowns to say that even among those who could 
not return to duty, many could have served the Army effectively in some other capacity, 
and many could even have gone through combat successfully if they had been subjected 
to shorter stretches of it. 
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like this. In addition, however, it is a necessary theoretical assump- 
tion in work of this kind that the psychoneurotic ineffectives be 
regarded as having come from among the men in the other universe, 
the “effectives.” By sampling station hospitals and by selecting 
for inclusion in the sample the sixteen station hospitals which were 
located in and received their patients from the camps sampled to 
constitute the cross section of the Army, this assumption was given 
some basis in fact. The sample of hospitalized psychoneurotics 
may, then, be regarded as a representative sample of all the types 
of psychoneurotics hospitalized in the Army before overseas service, 
and as a group which came from the same universe as the cross sec- 
tion represented. 


The Original Battery and Its Screening Results 


The questionnaire basic to this study contained over a hundred 
personality, attitude, and background items which were grouped 
into fifteen areas. These fifteen areas had been selected in con- 
sultation with psychiatrists and psychiatric literature as those, on 
the one hand, most relevant to the psychiatric diagnosis of psycho- 
neurosis and, on the other, likely to be susceptible to investigation 
through the nonintensive technique of a self-administered, anony- 
mous questionnaire. These fifteen areas are enumerated below, 
with a brief description of the types of questions contained in them,’ 
stated in terms of what was assumed on a priori grounds to represent 
good (nonpsychoneurotic) adjustment : 


Aspects of Personality Development 


1. Childhood relations with parents: Indexes the extent to which men felt 
that their parents had been lenient and fair and that other children in 
the family had not been preferentially treated. 

2. Childhood fears: Indexes the relative absence of such childhood fears 
as fears of being left alone, punishment, being laughed at, etc. 

3. Childhood neurotic symptoms: Indexes the relative absence of the 
traits of stuttering, stammering, nail biting, and enuresis in child- 
hood. 

4. School adjustment: Good adjustment is defined as having liked school 
and having achieved a reasonable degree of success there. 

5. Childhood fighting behavior: Good adjustment here implies some 
degree of aggressiveness in boyhood as indicated by having gotten 
into, held one’s own in, and had no fear of fights. 

6. Participation in sports: Good adjustment is defined as having taken 
part in such sports as baseball, basketball, football, golf, tennis, fish- 
ing, and hunting. 


4 Part 2 of this chapter shows the exact questions used and the distribution of an- 
sewers given by the cross section and by the psychoneurotic sample. 


SCREENING OF PSYCHONEUROTICS 498 


7. Emancipation from parenis: Good adjustment here had two main 
components—the actual physical emancipation involved in living 
away from home upon occasion and the psychic emancipation im- 
plied by establishment of satisfactory heterosexual relationships. 

8. Mobility: Well-adjusted men were expected to exhibit stability as 
indexed by their employment record, regularity of church attend- 
ance, and little geographical moving around. 


Description of Present Personality 


9. Sociability: Indexes a liking for and association with other people. 

10. Worrying: Good adjustment implied a lack of worry about one’s self 
and the future. 

11. Oversensitivity: Good adjustment is defined as an absence of such 
traits as irritability, quickness to take offense, resentment of criti- 
cism, etc. 

12. Personal adjustment: The well-adjusted man was thought to exhibit 
self-confidence and a lack of self-pity, depression, and anxiety. 

13. Psychosomatic complaints: Indexes an absence of vague physical 
complaints like “‘nerves,’”’ stomach trouble, insomnia, etc. 

War Motivation 


14, Identification with the war effort: Measures the extent to which men 
felt that the war was worth fighting and that they had a personal 
stake in it. 

15. Acceptance of soldier role: A willingness to be a soldier as judged by 
acceptance of fairness of being drafted, preference for job of soldier 
as against that of war worker, and a willingness to go overseas. 


These fifteen areas had each been designed to yield a scale in the 
scalogram sense discussed in Chapters 1 through 9. The reasons 
for this design were obvious; if these areas could be shown to scale, 
then each of them could be represented by a single numerical vari- 
able, and moreover, the simple correlation of these scale scores with 
any outside variable would be precisely the multiple correlation of 
that variable with all of the questions entering into the scale.!? 
The importance of these properties in simplifying computations is 
readily apparent, since, if it is desired to discriminate between two 
groups like normals and psychoneurotics on the basis of items which 
form a scale, the scores obtained by scale analysis give the best way 
to combine the items to get the maximum differentiation possible 
from the area as a whole. Accordingly, each of the areas was pre- 
tested, and all but four were found to forin scales. These four— 
childhood fears, childhood neurotic symptoms, participation in 


See Chapter 5 for discussion of this point. The important and definitional prop- 
erty of such a scale—that from an individual’s scale score can be reproduced his re- 
sponses to all of the items in the scale—is not directly relevant here, though, of course, 
it is just because a scale approximates this condition that an outside variable correlates 
in the fashion described in the text. 
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sports, and psychosomatic complaints—did, however, form quasi 
scales, which lack the property of scales of reproducing the behavior 
of an individual on all the questions from a single numerical vari- 
able, but do possess to a large extent the property of reproducing in 
its zero-order correlation with an outside variable the multiple 
correlation to be obtained from all the items. 

Therefore, each of the fifteen areas was scored with simple weights. 
For convenience, each of the questions was dichotomized and scored 
as zero or unity, unity being assigned to the responses presumed to 
indicate good adjustment. For some questions, there were found 
to be no differences between normals and psychoneurotics, but in 
no case did the scale or quasi scale scoring of an item prove to be 
the reverse of the empirically found differences on that item. This 
outcome indicated, first, that our a priori judgment had been cor- 
rect and, second, that the scale scoring, even on a dichotomous 
basis, preserved the predictive efficiency of the individual items. 
Since these item dichotomizations and the weights assigned them 
had been determined on the basis of the pretest made to determine 
the existence of scales, rather than the main study, the possibility 
that the scoring was capitalizing on chance error, with the conse- 
quence that less discriminating results would be obtained in another 
trial, was largely eliminated." 

As shown in Table 1, within each of these fifteen aspects of per- 
sonality, background, and adjustment, differences existed between 
the cross section and the psychoneurotic patients deriving from it. 
However, the procedure followed here is that of dichotomizing the 
score distribution of each variable into those scores occurring rela- 
tively more frequently among the psychoneurotics than among the 
Army cross section and those occurring relatively less frequently.“ 
Since the point of dichotomization was determined by the score 
distributions in the sample under study, it is to be noted that the 
tendency would be to overestimate somewhat differences between 
psychoneurotics and nonpsychoneurotics. For this reason, we have 
employed here the more conservative .99 level of sigmficance, and 
differences which were not found to be significant at this level are 
so indicated in the table. 

It is clear from these data that psychosomatic complaining was 

3 It is noteworthy, however, that the scoring would have been exactly the same if 
the main sample had been used to determine it, an outcome which suggests the stability 
of scale analysis. 


4 In Chapter 9, Volume II, we have introduced the concept of critical and noncritical 
scores to refer to this distinction. 


SCREENING OF PSYCHONEUROTICS 496 


the area which most sharply differentiated between psychoneurotics 
and nonpsychoneurotics, but there remained the question of whether 
a combination of all the items in the battery could improve on these 
results or not. Accordingly, the product-moment correlations be- 
tween the fifteen scores were computed and their multiple correla- 
tion with the criterion was examined.’ From what we have said 


TABLE 1 


SUMMARY OF COMPARISONS BETWEEN Cross SECTION AND PsYCHONEUROTIC 
Patients IN Firrgzn Angas Scormp DicuaoromMovusL_y 


(January-February 1944, Enlisted Men with No Overseas Service) 


PERCENTAGE WITH RELATIVELY 
LOW SCORES:* 


Area Cross Neurotic 
Section patients Difference 
Psychosomatic complaints 29 89 60 
Personal adjustment 30 67 37 
Childhood neurotic symptoms 20 53 33 
Childhood fears 32 62 30 
Sociability 16 45 29 
Acceptance of soldier role 31 59 28 
Oversensitivity 19 46 27 
Worrying 25 49 24 
Childhood participation in sports 40 64 24 
Childhood fighting behavior 30 45 15 
Childhood relations with parents 41 53 12 
Identification with the war effort 24 31 7 
Mobility 49 55 6** 
Emancipation from parents 36 42 6** 
School adjustment 51 56 5** 
Number of cases 8,501 563 


* In this table a “low” score signifies relative lack of adjustment, as this bas been previously defined, in 
each area. In the summary scores presented in Part 2 of this chapter, these scores are shown below the 
dividing line. The percentages given here are essentially the proportions obtained when the score distri- 
butions are dichotomized at the point which maximises the differance between the cross section and the 
psychoneurotics. 

*™ Not significant at the three sigma level. All differences are significant at the two sigma level. 


before, it is apparent that the multiple correlation of the criterion 
with these fifteen scores closely approximates the multiple correla- 
tion of the criterion on all the items in all of the fifteen areas. 

The actual correlations among the fifteen variables are shown 
for the cross section and the psychoneurotic patients separately in 

Ordinarily, to predict a qualitative criterion, the discriminant function would be 
used, but in the special case of a dichotomous criterion the discriminant function and 
multiple regression are the same. Using zero and unity as numerical values for the 


categories of the criterion and computing product-moment correlations therefrom yields 
the same results as does the linear discriminant function. 


TABLE 2 


INTERCORRELATIONS AMONG FIFTEEN PERSONALITY AND ADJUSTMENT VARIABLES 
Warts Enuistep MEN with No Oversgas Service: Cross Secrion aND Hosrrraizzep PsycHONSUROTICS CoMPARED* 


(January-February 1944) 


Child~ Child- 
Psycho- Child- _— Per- Parti- Identi- hood hood Rela- Emanci- 
somatic hood sonal Over- Child- cipa- fication fight- echool tions pation 
com- symp- adjust- sensi- hood Soldier Worry- Socia- tion in with ingbe- adjust- with from Mobil- 
plainte toms ment tivity Sears role ing bility sports war havior ment parents parents ity 
Psychosomatic 41 50 37 32 46 43 19 16 23 18 07 00 01 —.03 
complaints 44 45 39 35 -28 38 -28 .06 09 05 06 16 05 02 
Childhood 41 25 -26 24 .20 24 16 -16 16 15 ll 10 10 .00 
symptoms 44 37 .28 383 .30 25 81 21 13 17 .04 AS 10 07 
Personal -50 +25 43 26 49 41 17 12 32 08 15 10 05 —.03 
adjustment 45 .37 43 28 40 44 30 12 25 .O7 .20 11 4 05 
Oversensitivity 37 -26 43 24 22 34 .22 03 25 -.10 7 07 -02 05 
39 28 43 -29 17 38 38 06 -22 —.16 22 21 04 13 
Childhood fears 32 24 .26 24 18 36 15 08 .09 18 05 .02 10 05 
35 .33 28 29 19 43 31 17 05 16 01 12 15 00 
Soldier role 46 -20 49 22 A8 35 15 16 85 21 05 07 06 —.04 
28 30 40 17 19 25 18 25 -24 22 .09 02 14 —~ .05 
Worrying 43 24 Al 34 .36 35 15 08 23 19 04 02 10 —.09 
88 25 44 38 43 +25 25 07 15 12 06 At 15 00 
Sociability 19 16 17 22 15 1S AS 18 15 ll -09 07 04 -09 
.28 31 -30 38 31 18 25 21 23 lu 10 17 09 13 
Participation 16 -16 12 .03 08 16 .08 18 -08 -23 -08 05 A2 01 
in sports 06 21 12 06 17 -25 .07 21 13 -29 10 -05 13 05 
Identification 28 16 32 25 .09 35 -23 15 .08 09 -06 06 08 02 
with war 09 18 25 «22 05 24 15 23 13 .07 .07 10 05 10 
Childhood fighting 18 15 08 —.10 18 21 19 AL 23 09 08 —.03 15 —.10 
behavior 05 17 .07 —.16 16 22 12 pet -20 07 —.13 -.11 23 11 
Childhood school 07 ll 15 17 .05 05 04 09 08 -06 08 -.01 01 11 
adjustment -06 04 -20 22 01 09 06 10 10 07 —.13 04 10 09 
Relations with 00 10 10 O07 02 .O7 02 .07 -05 06 —.03 -.01 .00 .03 
parents 16 15 All 21 12 .02 1 17 05 10 -.11 .04 —.04 06 
Emancipation 01 10 05 02 10 06 10 04 12 08 15 01 00 —.28 
from parents -05 -10 14 04 15 14 15 08 313 -05 23 10 —.04 —.27 
Mobility —.03 00 —.03 -05 05 —.04 —.09 09 01 02 —.10 Al 03 —.28 
.02 07 05 13 .00 —.05 .00 13 05 10 -—.11 08 .06 —.27 


® In each cell, the coefficient in the upper left-hand corner represents the correlation in the Army cross section; the coefficient in the lower right-hand corner, that 
in the psychoneurotic group. 
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Table 2. Before the intercorrelations could be computed for the 
multiple regression, however, it was necessary to determine how to 
weight the cross section and the psychoneurotics in the computa- 
tions. The proper equations were derived and showed that the 
weights should be such that the two groups in the criterion had 
equal variances on the prediction scores. Since the two sets of 
intercorrelations shown in Table 2 were so similar and since the 
variances of the scores did not differ much between the two groups, 
it did not appear worth while to solve the cumbersome equations 
necessary to get the exact weights, for weighting the two groups 
equally would approximate the equating of their variances on the 
prediction scores. Accordingly, the intercorrelations were recom- 
puted for the combined population of psychoneurotics and non- 
psychoneurotics, regarding each of these groups as making up one 


TABLE 3 


INTERCORRELATION AMONG Firrnen PERSONALITY AND ADJUSTMENT VARIABLES 
AND CRITERION 
Wars Enustep Men wits No Overseas SERVICE: 
Cross Sxcrion aNp HosprrraLizep PsycHONEUROTICS WEIGHTED EQuaLLty 
AND CoMBINED 


(January-February 1944) 


j 3 
‘ as 3 
; ; . 25 i a § Ss me 3 ds 
2 i 3 t 29 283, 3 # 2 ae 
So ag Ss & i Bogs: $ 3 £ 33 $8 z 
erseig g2 62 £ Fs8ke Fy 2) Gb ge 
Saessens 6S Ff BESKESE SF SY SF ae & 
Criterion -66 .38 42 .33 83 .35 8 .27 .33 .28 .12 18 11) «6120 08) 08 
Psychosomatic 
complaints 66 55 .60 .48 .46 50 47 .38 .27 .19 20 12 18 07 06 
Childhood 
symptoms 38 .55 40 386.39 .36 6.32 34.27.17 .22) 12) .1606«.12 07 
Personal 
adjustment 42 .60 .40 AQ 37 .53)) «47 383 .22 300 £14.19) .14i«1stsis«Ck 
Oversensitivity .33 48 .36 .49 36 © ©.28 ~=«.41 89 «138.25 —.07 .22) 18 «83.0512 
Childhood 
fears -33 .46 .39 .37 = .36 28 6.46 35 .21.12 .22) 07 12) 116 = =.02 
Soldier 
role .35 .60 .36 .63 .28 .28 .36 .25 .28 .31 0.26.10 .0—«wsd:2:s« —-.01 
Worrying -27 47 .32 47 41 46 .36 -27 14.20 «£19 «=©=.08) = 610s «13 —.01 
Sociability -33 .38 .34 .330 4.39.35 00.25.27 27.210 (152K CiOCi«“dR 
Participation 
in sports .28 .27 .27 .220 13.21) «2814 27 12 320 .12 089 14 86.05 
Identification 
with war «12 .19 .17 .30 0©=«.25 12) 31.20.21 12 09 07) .10 06 .07 
Childhood 
fighting behavior -18 .20 .22 .14 —.07 .22 .26 8.19 .15 .30 .09 —.08 —.05 .19 —.09 
Childhood school 
adjustment Al 12.12.19 .22 07 .10 .08 .12 .12 .07 —.08 03 .06 .10 
Relations 
with parents 12.18.16 .14 .18 .12 .00 .10 .15 .08 .10 —.05 .03 —.01 .06 
Emancipation 
from parents .08 .07 .12 .11 § .05 .16 11.13 .06 .14 06) =.19 3.06 —.01 — .28 


Mobility -08 .06 .07 04 .12 .02 —.01 —.01 .12 05 .07 -.09 10 .06 —.28 
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half of the combined sample.'* These intercorrelations and the 
correlation of each variable with the criterion are shown in Table 3. 

Rather than use the Doolittle or some similar method for com- 
puting the multiple regression, factor analysis was employed. 
Computing the regression exactly from Table 3 would have tended 
to capitalize on sampling error and would have been a somewhat 


TABLE 4 


Loapines or Common Factor, Muutiete Recression WEIGHTS 
AND RE.LaTIVE ImporTANCE IN ToraL ScorB oF Firreen 
Txsts In a PsycHongevurotic Batrery 


(January-February 1944) 


MULTIPLE REGRESSION WEIGHTS: 


Number of 
score points 
Test Communality Raw = Corrected* posstble** 

Psychosomatic complaints 61 34.5 8 120 
Childhood symptoms Al 13.0 12 48 
Personal adjustment 51 17.4 10 60 
Oversensitivity 36 10.4 4 40 
Childhood fears 34 10.1 3 57 
Soldier role 38 11.3 9 36 
Worrying 33 8.2 7 28 
Sociability .30 9.5 10 30 
Participation in sports .18 6.9 5 30 
Identification with war .13 2.8 2 10 
Childhood fighting behavior 08 4.0 4 12 
Childhood school adjustment 05 2.3 2 6 
Relations with parents 05 2.6 2 12 
Emancipation from parents 03 1.7 1 6 
Mobility 01 1.8 2 8 


* The raw weights have been reduced to scale by dividing each by the standard deviation of seores on 
the corresponding test. 


* These values are simply the corrected weight times the maximum unweighted score possible on the 
original test (i.e., the number of items in the test). 


laborious procedure as well. Instead, Thurstone’s centroid method 
was used to extract one common factor, and it was decided that the 
residuals were too small to go any further. The approximate re- 
gression weights for predicting the criterion from the fifteen areas 
were then computed from the factor loadings by Guttman’s formu- 

16 Tt might be pointed out that these correlations were computed only to maximize the 
multiple correlation with the criterion, that is, to maximize the discrimination between 
the cross section and the psychoneurotics. The determination of the proper weights 


to assign these groups was a mechanical problem which did not depend on or make any 
assumptions about the proportion of psychoneurotics actually in the population. 
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las..7 The factor loadings and the weights derived from them 
were, as may be seen in Table 4, such as to suggest that the single 
area of psychosomatic complaints would discriminate about as well 
as the entire combination of fifteen areas. Random subsamples of 
the cross section and psychoneurotic sample were selected and 
scored according to these regression weights, and the result con- 
firmed the earlier impression: when these subsamples were assigned 
weighted scores based on all fifteen areas, 27 per cent of the non- 
psychoneurotic group and 87 per cent of the psychoneurotic group 
received critical scores, while the simple dichotomous scale scoring 
of the psychosomatic complaints area alone resulted in 29 per cent 
of the nonpsychoneurotics and 93 per cent of the psychoneurotics 
in these subsamples receiving critical scores. As a further check, 
the multiple regression weights of the criterion on the first eight 
variables in Table 3 were computed by the Doolittle method and 
the subsamples rescored, with exactly the same results as the use of 
the fifteen-area weighted scores had yielded. Clearly, there was 
little to be gained from preserving the other fourteen areas, and our 
attention was thereafter directed to attempting to improve the 
discrimination obtained from the psychosomatic complaints area. 


Further Refinement of the Psychosomatic Complaints Score 


Since the fifteen questions used in this score were known to form 
@ quasi scale, and the criterion also fitted into this pattern, it was 
to be expected that different methods of weighting would not yield 
essentially different results. However, even a small improvement 
in discrimination seemed worth striving for, since a 5 per cent re- 
duction in the number of nonpsychoneurotics receiving critical 
scores or a 5 per cent increase in the number of psychoneurotics 
screened would be of practical usefulness. The experiments with 
weights yielded, as we shall see, essentially negative results, although 
they bore out in striking fashion the stability of the quasi-scale 
pattern. 

The results to be discussed here represent the discrimination 
obtained when each of three types of weights was employed: 

1. Simple dichotomous (0, 1) weights. These were weights deriv- 


See Louis Guttman, “Multiple Rectilinear Prediction and the Resolution into 
Components,” Psychometrika, Vol. 5, No. 2 (June 1940), pp. 75-99; and Louis Guttman 
and Jozef Cohen, “Multiple Rectilinear Prediction and the Resolution into Com- 
ponents: u,” Psychometrika, Vol. 8, No. 3 (September 1943), pp. 169-183. 
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ing from the quasi-scale pattern of the items and consisted essen- 
tially of giving a weight of zero to those answer categories of a ques- 
tion which were checked by a significantly larger proportion of 
psychoneurotics than that in the cross section, and a weight of one 
to all other responses. 

2. Simple trichotomous (0, 1, 2) weights. Fourteen of the fifteen 
items used were stated in terms of a three-point check list, while 
the fifteenth had four. The trichotomous weights assigned a 
weight of zero to the response category which occurred most dis- 
proportionately among the psychoneurotics, a weight of two to the 
response category at the other extreme and a weight of one to the 
intermediate response. An arbitrary grouping of two answer cate- 
gories was made in the case of the fifteenth question in order to 
score it trichotomously. 

3. Differential trichotomous weights. These were the best weights, 
in the sense of least squares, for discriminating between the cross 
section and the psychoneurotic group and were derived from the 
multiple regression of the criterion on the fifteen items. 

Use of the simple (0, 1) weights in the area of psychosomatic com- 
plaints yielded, as we have seen before, a sharp differentiation be- 
tween the cross section and the sample of psychoneurotic patients. 
In fact, only 28.6 per cent of the cross section received scores of 9 or 
less as compared with 89.4 per cent of the sample of psychoneurotics. 
These data are shown in the top diagram of Chart I. The increased 
range of scores made possible by the use of (0, 1, 2) weights resulted 
in a slight improvement over these results, as seen in the center dia- 
gram of Chart I. Nor were the results obtained by using weights 
derived from the multiple regression of the criterion on the fifteen 
items substantially different from the discrimination achieved 
through use of simple weights. (See the bottom diagram of Chart I.) 
In fact, in the very course of deriving these weights, it was proved 
mathematically that, for this set of data, the use of differential 
weights would not yield much improvement. This outcome was 
such a dramatic empirical demonstration of the scale property pre- 
viously referred to—viz., that the zero-order correlation of simple 
scale scores with an outside variable is the multiple correlation of 
that variable with all of the items in the scale—that a presentation 
of the derivation of these weights is worth while. This appears as 
Part 3 of this chapter. 


CHART I 


DisrRisvTIoN or Psycrosomatic Compiaints ScorEs* 
(January-February 1944) 


Using simple dichotomous weights 
Cross section 


Shaded orea of each distribution represents 


proportion receiving “false” scores. 
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Using differential trichotomous weights 


PROPORTION RECEIVING 
EACH SCORE 
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* The position of the arrow indicates the cutting point employed to separate critical 
scores (on the left of the arrow) from noncritical scores. The score with which the 
arrow coincides is considered a critical score. Thus, in the top diagram, 0 through 9 
are critical scores, 10 through 15 are noncritical. 
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Problems in the Application of the Test to 
Induction Station Screening 


When it was decided to adapt the psychosomatic complaints 
score to induction station screening, certain questions affecting its 
transferability and usefulness arose. First of all, there was the 
easily answered question of the reliability of test scores. Then, 
there was a series of minor points growing out of differences in the 
administration of the test which would have to be introduced. The 
test items originally had been asked as part of a larger questionnaire 
through which they were scattered; did surrounding them with 
other questions affect the way they were responded to or would 
they be answered similarly when asked with no context? More- 
over, the Army study employed an anonymous questionnaire; 
would the necessary requirement that induction station tests be 
signed alter men’s responses? 

Even if these points could be settled and dismissed there were 
some more serious issues to be dealt with. The test had been de- 
veloped to distinguish only psychoneurotics from other men, since 
other categories of psychiatric disability were relatively insignificant 
among the men already in the Army. But, in dealing with the 
unselected population reporting to induction stations to be exam- 
ined, psychiatrists examined men with reference to all types of 
psychiatric disorders. A screening test which detected psychoneu- 
rotics but permitted psychopaths and psychotics to go by unde- 
tected would be of little use, for then all men would have to be 
examined for these latter tendencies anyway. The question then 
was whether the test could be extended to psychiatric screening 
as well as merely to psychoneurotic screening. 

Finally, there was the basic question of whether the test would 
really work. Thus far we had only the criterion situation in which 
the test was developed, a study in which the psychoneurotic and 
nonpsychoneurotic groups were known, and distinguishing between 
them was a formal exercise rather than a practical necessity. It 
might be, for example, that the detection of the psychoneurotics 
and their removal to hospitals accounted for much of the differences 
in their scores and the scores of men generally. If this were true, 
then they could not have been detected by the test so successfully 
prior to hospitalization. Or, it might be that the general level of 
psychosomatic complaining changed from civilian to military life 
so that the cutting point derived from the Army sample to distin- 
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guish critical from noncritical scores would not, in the civilian set- 
ting, furnish a sharp enough discrimination between psychiatric 
ineffectives and others. 

The reliability of test scores. The test-retest reliability of the 
psychosomatic complaints score was measured by administering 
the test to groups of enlisted men on duty and of psychoneurotic pa- 
tients and retesting them a week later. As shown in Table 5, the 
test scores proved to be highly reliable; in fact, about 90 per cent of 
each group were classified consistently, that is, their scores fell on 
the same side of the cutting point both times. The actual correla- 
tions of test scores with retest scores were .93 and .90 within the 


TABLE 5 


Terst-Rerest Psycuosomatic Compiaintrs Scorgs 
AMONG PSYCHONEUROTIC AND NONPSYCHONEUROTIC ENLISTED MEN 


(April 1944) 
CLASSIFICATION OF: PROPORTION IN EACH CLASSIFICATION AMONG: 
Test score Retest score Enlisted men on duty Psychoneurotic patients 

Noncritical Noncritical 67.6% 21.8% 
Noncritica! Critical 3.0 4.0 
Critical Noncritical 6.4 6.9 
Critical Critical 23.0 67.3 

100.0% 100.0% 

Number 299 101 


nonpsychoneurotic and the psychoneurotic groups respectively. 
Indeed these relationships were so high that it was feared that they 
might be in part a result of carryover from the test to the retest 
situation, especially in view of the small period of time between 
them. But estimates of the lower limit of the reliability coefficient, 
using the Kuder-Richardson formulas, were comparably high, 
being .87 and .82 respectively. 

The effect of context. In order to check on the possible influence 
on test scores which asking the items as part of a larger question- 
naire in which they were embedded might have had, a form was 
prepared which placed the test items at the beginning of the ques- 
tionnaire so that the answers to them could not be influenced by 
preceding questions. This new form was administered in conjunc- 
tion with the original form to a small group of enlisted men on duty 
and another group of enlisted men who were psychoneurotic pa- 
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tients, a random half of each group filling out each form. As 
shown in Table 6, there were no significant differences attributable 
to the type of form used. 

The effect of anonymity. There was some question of whether 
signing their names would make men more or less cautious in claim- 
ing psychosomatic symptoms than they were under strictly anony- 
mous conditions. On the one hand it was argued that identifica- 
tion and the consequent possibility of checking up on their responses 
would make men more cautious in claiming symptoms. On the 
other, hand, identification and the implication that test scores would 
have § some bearing on one’s draft classification or disposition in the 


TABLE 6 


RxguaTion o¥ Presence ok ABSENCE OF CONTEXT TO PsycHOSOMATIC 
Comp.taints Scorrs AMONG ENLISTED MEN ON Dury AND 
PsycHONEUROTIC PATIENTS 


(April 1944) 
PROPORTIONS RECEIVING CRITICAL SCORES: 
Type of form answered Nonpsychoneurotics Psychoneurotics 
With context 33.8% (145) 74.0% (60) 
Without context 25.3 (184) 74.5 (61) 
Difference 8.5 —0.5 
Critical ratio of difference 1.60 0.02 


Army might encourage some men to exaggerate their symptoms. 
Obviously, if the use of an identified form made a difference, then 
the test standardized under another set of conditions would not 
necessarily perform adequately. 

In three studies, two made in the Army and one in an induction 
station, small but consistent differences were found between men 
answering the test anonymously and men asked to sign their names 
to their test forms. In each case, critical scores occurred somewhat 
less often among the identified men. The data from the induction 
station study in which random halves of a group of men being 
examined there filled out signed and anonymous forms, with no 
explanation of the purpose of the schedules, are shown in Table 7. 
These differences are not in themselves statistically significant, nor 
do they constitute sufficient replications to confirm the existence of 
a real tendency. But even if other studies would have confirmed 
the difference, it was of such a small order as to be of no practical 
importance. 
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The effect of change of situation and criterion. Up to this point, 
when we have spoken of distinguishing psychoneurotics from non- 
psychoneurotics our data have been derived from a comparison of 
men known or presumed to be psychoneurotic ineffectives (the 
criterion group) with a group known or presumed to be free of the 
criterion. This is, of course, a standard procedure in developing 
tests of this kind, but the crucial question on which such a test’s 


TABLE 7 


DIsTRisurion oF Psycuosomatic ComPLaints ScorEs AMONG 
ANONYMOUS AND IDENTIFIED PREINDUCTION EXAMINERS 


(March 1944) 


PROPORTIONS RECEIVING EACH SCORE PROPORTIONS RECEIVING GIVEN 
SCORE SCORE OR LOWER 
Anonymous Identified Anonymous Identified 
men men men men 
30 0.4% 2.5% 100.0% 100.0% 
29 3.4 3.8 99.6 97.5 
28 2.9 8.8 96.2 93.7 
27 7.1 5.9 93.3 84.9 
26 5.4 7.1 86.2 79.0 
25 4.6 6.3 80.8 71.9 
24 6.3 8.4 76.2 65.6 
23 8.3 6.7 69.9 57.2 
22 8.8 5.9 61.6 50.5 
21 7.6 5.5 52.8 44.6 
20 3.8 3.8 45.3 39.1 
19 7.1 3.8 41.5 35.3 
18 3.8 3.8 34.4 31.5 
17 5.4 4.6 30.6 27.7 
16 2.5 3.4 25.2 23.1 
15 2.9 4.2 22.7 19.7 
14 4.2 3.4 19.8 15.5 
13 2.9 2.1 15.6 12.1 
12 3.4 2.9 12.7 10.0 
ll 17 1.3 9.3 7.1 
10 17 0.4 7.6 5.8 
9 1.7 1.7 5.9 5.4 
8 2.1 0.4 4.2 3.7 
7 1.3 0.8 2.1 3.3 
6 0.0 17 0.8 2.5 
5 0.0 0.4 0.8 0.8 
4 0.0 0.0 0.8 0.4 
3 0.0 0.0 0.8 0.4 
2 0.0 0.4 0.8 0.4 
1 0.8 0.0 0.8 0.0 
0 0.0 0.0 0.0 0.0 
Total 100.0% 100.0% 


Number 238 289 
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usefulness depends is whether it could have achieved similar results 
when applied to an unselected population prior to its separation 
into the two components in which we are interested; viz., psycho- 
neurotics and nonpsychoneurotics. That is to say, for screening 
purposes our results must be translatable into the statement that if 
(roughly) the third of an unselected group of enlisted men who 
scored lowest on our test were to be examined, there would be found 
among this third of the group close to 90 per cent of all those in the 
entire group who would later become psychoneurotic ineffectives. 
This in turn implies that the test would lend itself to a practical 
screening procedure only if it could be shown that the scores ob- 
tained by psychoneurotic ineffectives were not appreciably affected 
either by their hospitalization or by developments between their 
induction and their hospitalization; in other words, that as large a 
percentage of them would have received critical scores while still in 
an ordinary milieu as did in the hospital setting. It was also of 
some relevance to the test’s practical usefulness to know how stable 
the scores of nonpsychoneurotics remained. If there were any 
appreciable shifts in their scores from civilian to military life, then 
the test might require restandardization in terms of the scores to 
be considered critical. 

Ideally, these crucial points should have been tested by a study 
in which a group of men being examined for induction were given 
the psychoneurotic test and then followed up after a reasonable 
length of time in their Army careers, say a year or two. It could 
then have been determined what had become of these men, and 
their earlier scores classified according to whether they had be- 
come psychoneurotic ineffectives or not. A comparison of the 
score distributions for the “failures’’ and ‘“‘successes’”’ would then 
have clearly indicated how effectively the test actually discrimi- 
nated before the fact. Moreover a retesting of these men would 
have permitted a determination of the direction and extent of score 
shifting accompanying the change from civilian to military life. At 
the time the NSA was being developed, however, the need for it or 
a similar test seemed urgent, and there simply was not time to 
undertake this definitive check. 

As an alternative to the procedure just outlined, there was the 


4% Such a study was later undertaken. A sample of about 5,000 men inducted into 
the Army at approximately the same time in different parts of the country was gathered 
and the preinduction NSA scores of the men were secured. The ending of the war 
rather soon afterwards, however, made the follow-up impractical. 
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faster though less exact technique of comparing the scores obtained 
in the samples of enlisted men studied with those of comparable 
groups of men prior to induction. But in order for comparability 
even to be approximated, it was necessary that the men tested 
prior to induction be separable into the criterion group of psycho- 
neurotic ineffectives and the contrast group of effectives. Yet the 
criterion was defined in terms of actually proving to be ineffective 
in the Army situation, a reaction which could not be determined 
prior to induction and service. Psychiatrists’ diagnoses were clearly 
the only substitute way of separating the criterion and contrast 
groups, but their use meant substituting for the original criterion 
of actually becoming ineffective, psychiatrists’ predictions that cer- 
tain men would prove to be psychoneurotic ineffectives if they were 
inducted. The two criteria would have been interchangeable 
only if it could be assumed that the diagnostic decisions psychi- 
atrists made on the basis of brief induction station examinations 
were completely accurate in terms of the original criterion, that is, 
that every man found by psychiatrists to be acceptable for service 
remained effective and that every man they rejected would have 
become ineffective if inducted. But the very fact that there were 
breakdowns and discharges of men once found acceptable indicated 
the less than perfect correspondence of the two criteria. In fact, 
one study made by the Surgeon General’s Office showed that the 
men inducted from an induction station where a high rate of psy- 
chiatric rejections prevailed (presumably, therefore, a rather selected 
population psychiatrically) later had as high a rate of discharges 
for psychiatric defects as a presumably less selected group of men 
inducted from a neighboring station which had a low rate of psychi- 
atric rejections. This study, as well as others showing a rather high 
survival rate in the Army for groups of men who would ordinarily 
have been rejected by psychiatrists, suggested that there might 
even be relatively little relationship between the two criteria. 
On the other hand, though the Army’s goal was to increase effi- 
cient use of man power both by reducing rejections of men who 
would have succeeded in the Army and by decreasing the inductions 
~ # Tt should be remembered that it was far easier to decide whether or not 8 man 
should be diagnosed psychoneurotic than it was to decide whether he should be re- 
jected for this reason, especially when not too much was known about the psychological 
factors making for ineffectiveness or survival in the Army. Many psychiatrists there- 
fore tended to regard any psychoneurotic as a bad risk for the Army. Others tended 
to assume that psychoneurotics who had developed minimally adequate adjustments 


to their civilian situation could also do so in military life, and so to recommend their 
induction. Thus, errors of judgment could be made on both sides. 
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of men who would become ineffective, it is certain that any test 
which did not reproduce rather well the psychiatrists’ induction 
station judgments would have been impossible of adoption, even if 
it did discriminate well with reference to the criterion of ineffective- 
ness which the psychiatrists were predicting. The reason for this 
is obvious: it must be recalled that the test scores themselves did 
not lead directly to disposition; rather, they peeled off a fairly siz- 
able group, among whose number the psychoneurotics to be rejected 
were to be discovered by psychiatric examination. The standards 
of judgment which psychiatrists would use in examining this sub- 
group would naturally be those they employed prior to the intro- 
duction of the test. If the diagnostic criteria implicit in the test 
were not rather close to those ordinarily employed by psychiatrists, 
then they would not in fact find a disproportionate number of 
psychoneurotics in the screened subgroup and would, therefore, 
have no confidence in the test. And, even if a screening test simply 
reproduced the behavior of induction station psychiatrists without 
in any way improving on their efficiency in economizing man power, 
it at least had the virtue of saving psychiatrists’ time, which was 
itself critically scarce. 

At any rate, a small trial of the test was undertaken at one induc- 
tion station and certain conclusions were drawn from it. In this 
study, Research Branch personnel administered the test to a ran- 
dom group of men being processed and then removed the tests from 
the station. Psychiatrists examined the men in the usual fashion, 
without knowledge of either the men’s test scores or even the items 
in the test itself. These independent diagnoses were then collected 
and the men’s test scores were compared with them. The basic data 
from this trial are shown in Table 8. 

As we see here, when critical scores were divided from noncritical 
scores at the same point as was arrived at by analysis of the Army 
data, then 18.0 per cent of the men found acceptable for service 
received critical scores as compared with 81.8 per cent of the men 
rejected for psychiatric reasons. It will be recalled that, in the 
Army study, 26.8 per cent of the cross section and 89.6 per cent of 
the hospitalized psychoneurotic patients had received critical 
scores. At first glance, then, the test was about as successful in 
screening with reference to the original criterion. 

In comparison with the induction station data, however, the 
Army data seemed to indicate that there was a general downward 
shifting of scores in the Army milieu. That is to say, men already 
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TABLE 8 


Psycuosomatic CompLaints Scorgs oF CANDIDATES FOR INDUCTION, 
Cuiassirrmp BY Disposrrion: ONE INDUCTION STaTION 


(March 1944) 


PROPORTIONS RECHIVING EACH SCORE PROPORTIONS RECEIVING 
GIVEN SCORE OR LESS 


se ih, ¥, ih 


ae 
= tes ¢ 
STE EE 


3 F383 
ge® e2S ¢ ees 
si ss Sek Sek BE Sos Se 
30 2.5% 3.7% 0.0% 0.0% | 100.0% 100.0% 100.0% 100.0% 
29 3.8 4.8 2.9 0.0 97.5 96.3 100.0 100.0 
28 8.8 9.7 14.7 0.0 93.7 91.5 97.1 100.0 
27 5.9 7.8 3.0 0.0 84.9 81.8 82.4 100.0 
26 7.1 7.2 11.8 2.6 79.0 74.0 79.4 100.0 
25 6.3 7.2 8.8 0.0 71.9 66.8 67.6 97.4 
24 8.4 9.1 11.8 2.6 65.6 59.6 58.8 97.4 
23 6.7 9.1 0.0 2.6 57.2 50.5 47.0 94.8 
22 5.9 6.0 8.8 2.6 50.5 41.4 47.0 92.2 
21 5.5 4.8 14.7 0.0 44,6 35.4 38.2 89.6 
20 3.8 4.8 0.0 2.6 39.1 30.6 23.5 89.6 
19 3.8 4.2 3.0 2.6 35.3 25.8 23.6 87.0 
18 38 3.6 5.9 2.6 31.5 21.6 20.5 84.4 
17 4.6 5.4 0.0 5.3 27.7 18.0 14.6 81.8 
16 3.4 3.0 2.9 5.3 23.1 12.6 14.6 76.5 
15 4.2 3.0 2.9 10.6 19.7 9.6 11.7 71.2 
14 3.4 1.8 3.0 10.6 15.5 6.6 8.8 60.6 
13 2.1 1.8 0.0 5.3 12.1 4.8 5.8 50.0 
12 2.9 0.6 2.9 13.2 10.0 3.0 5.8 44.7 
11 1.3 0.0 2.9 5.3 7.1 2.4 2.9 31.5 
10 0.4 0.0 0.0 2.6 5.8 2.4 0.0 26.2 
9 1.7 0.6 0.0 7.9 5.4 2.4 0.0 23.6 
8 0.4 0.6 0.0 0.0 3.7 1.8 0.0 15.7 
7 0.8 0.6 0.0 2.6 3.3 1.2 0.0 15.7 
6 1.7 0.6 0.0 7.9 2.5 0.6 0.0 13.1 
5 0.4 0.0 0.0 2.6 0.8 0.0 0.0 5.2 
4 0.0 0.0 0.0 0.0 0.4 0.0 0.0 2.6 
3 0.0 0.0 0.0 0.0 0.4 0.0 0.0 2.6 
2 0.4 0.0 0.0 2.6 0.4 0.0 0.0 2.6 
1 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 
0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 


Total 100.0% 100.0% 100.0% 100.0% 


Number 
of cases 238 166 34 88 


* Includes men rejectable for both physical and psychiatric reasons. 
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in the Army scored less favorably on the average than men accept- 
able for service but not yet in the Armed Forces. The difference 
between the proportions receiving critical scores was the more re- 
markable because the men already in the Army had been subjected 
to more psychiatric selection than the potential inductees, through 
the breakdown and hospitalization of some of its original numbers. 
In so far as the military patients could be assumed to have come 
from among the small proportion of men who received critical scores 
but were inducted,” we would expect that the Army group would 
appear to have better scores than the induction station group, even 
if there had been complete stability of individual scores. Instead 
we found a difference in the other direction. 

Taken by themselves, these data would appear to suggest that an 
upward revision of the highest score regarded as critical at induc- 
tion stations was in order. However, these data were derived from 
only one induction station, and there was no way of knowing how 
representative they would be of other stations in other parts of the 
country.24_ Moreover, though the percentage of critical scores 
among men acceptable for induction appeared lower than the pro- 
portion among men already in the Army, as far as could be told 
from the small number of cases available, the same was true of the 
proportion of psychiatric rejections screened at induction stations 
in comparison with the figure among psychoneurotics hospitalized 
in the Army. Yet, despite the generally upward shift of score 
distributions for both acceptable men and unacceptable men at 
induction stations in comparison with the Army data, these two 
distributions intersected at the same scale score for the induction 
station data as they had in the Army data. The point of intersec- 
tion is, of course, the most efficient cutting point to use in separating 
critical from noncritical scores, though it need not be used if some 
other point better implements the objective of the test. 

In view of the coincidence of the cutting points in the two studies, 
it was decided that for the time being, at least, no revision of cutting 
point should be made. This reasoning was supported by at least 
two further considerations. First, since the data for psychiatric 
rejections consisted of only thirty-eight cases, it was felt that the 


® This is, of course, the crucial and untested question which makes it impossible to say 
definitively whether the test screened well in practice with reference to the criterion of 
future ineffectiveness. 

= Evidence gathered later confirmed the reality of the apparent shift in scorea, but 
this information was not available during the developmental phases of the test. 
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general shape of the distribution of these scores, as reflected in the 
point of intersection, was more reliable than the exact percentage 
distribution. That is to say, no matter where the cutting point 
had been set, the percentage of psychiatric rejections being screened 
at that point could not be determined with any reliability, so it was 
not very useful to attempt to raise the cutting point to the level at 
which some particular percentage was screened. Second, the results 
obtained were based on the application of a test for psychoneurotics 
to all psychiatric rejections. After extensive discussion with psy- 
chiatrists, the conclusion was reached that questions aimed more 
explicitly at the detection of psychotics and psychopaths would 
have to be added. Since this revision of the test would change the 
score distributions anyway, there was little to be gained from fixing 
on any particular cutting point until the new distributions were 
available, so the original cutting point was continued. 

Questions added to detect “critical signs.’’ The questions finally 
added in the interest of extending the test’s psychiatric applicability 
were eight in number.??. These were not combined with the orig- 
inal fifteen items which continued to be scored as we have described, 
but were treated as qualitative. Possession of a critical sign, as an 
affirmative response to any of these questions was designated, was 
considered sufficient reason for referral to the psychiatrist for 
evaluation, regardless of whether the numerical score was critical 
or not. Thus, the final test score consisted of two parts: the nu- 
merical score and a set of symbols to indicate what critical signs 
had been claimed by the respondent. Men with scores critical in 
either respect (i.e., a low numerical score or the possession of one 
or more critical signs) were to be considered screened by the test. 

In this form the test was named the Neuropsychiatric Screening 
Adjunct (NSA).* It was officially adopted for use at all induction 


™ The exact questions, to be answered by a simple ‘‘yes” or “‘no,” were: 

. Have you ever had stomach ulcers? 

. Do you ever take dope? 

. Have you ever had fits or convulsions since you were ten years old? 

. Did you ever have a nervous breakdown? 

. Were you ever 8 patient in a mental hospital (because of your nerves)? 

. Were you ever sent to reform school? 

. Have you ever gotten into serious trouble or lost your job because of drinking? 
. Do you ever wet the bed? (This means urinate in bed, not wet dreams.) 

% The naming of the test was a historical accident, which need not concern us here, 
except to point out that the test obviously had no bearing on the detection of the neuro- 
logically unfit, who may, for our purposes, be regarded as a subcategory of the physical 
rejections, 
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stations beginning in October 1944.% The procedures outlined 
for its use called for its administration to all men passing the mini- 
mum literacy test, and the psychiatric examination of those who 
were screened by it, i.e., received critical scores and/or critical signs. 
The psychiatric examination of men who took the test and were 
not screened by it was left discretionary, but it was intended that 
until its use had been routinized and a further check of its operation 
could be made, all men be seen by the psychiatrists regardless of 
test scores. As it turned out, the extreme need for the NSA to 
relieve the pressure on psychiatrists had diminished by the time of 
its introduction, and it was never considered necessary to make full 
use of the test to pass certain men without individual examinations. 


PART @ 


QUESTIONS USED IN INITIAL SURVEY AND THEIR 
DISCRIMINATING UTILITY 


Comparisons are shown here of responses of 3,501 white enlisted 
men with no overseas service and 563 psychoneurotic patients in 
Army hospitals, in January-February 1944.25 ‘No answers’ are 
omitted. 

The attitude data are, of course, men’s subjective reports and 
may not be accurate measures of the actual incidence of the be- 
haviors the questions refer to. 

The items cover the fifteen areas described earlier in this chapter. 


1. BACKGROUND 


How old were you on your last birthday? 
Neurotic Cross 
patients section 


30 and over 40% 22% 
25-29 27 25 
CSS ee ee ee ee 
21-24 22 35 
Under 21 11 18 


100% 100% 


% See War Department Memorandum No. 40-44 (19 September 44) ‘Psychological 
Examining for Neuropsychiatric Tendencies.” 

% 8-99. 

* This line marks the cutting point at which questions were divided for the purpose of 
setting up dichotomized series. 
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How far did you go in school? 


College 
High school graduate 


Some high school 
Grade school 


Are you single, married, divorced, separated, or widowed? 


Single 


Married 
Divorced, separated, or widowed 


613 


Where did you live most of the time before you were sixteen years 


old? 


Farm 
Small town (under 2,500) 
Town (2,500-25,000) 


City (25,000-100,000) 
Large city (over 100,000) 
How long have you been in the Army? 
Over 2 years 
Over 1 year to 2 years 


Over 6 months to 1 year 
6 months or less 


How many brothers and sisters older than you do you have? How 


many brothers and sisters younger than you do you have? 


Only child 
Youngest child 


Intermediate child 
Oldest child 


Neurotic Cross 
patients section 
9% 16% 
15 2 
37 31 
39 21 
100% 100% 
Neurotic Cross 
patients section 
43% 61% 
52 36 
5 3 
100% 100% 
Neurotic Cross 
patients section 
29% 26% 
11 13 
14 18 
15 15 
31 28 
100% 100% 
Neurotic Cross 
patients section 
138% 20% 
41 49 
19 19 
27 12 
100% 100% 
Neurotic Cross 
patients section 
6% 8% 
22 21 
52 49 
20 22 
100% 100% 
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Did your parents always live together up to the time you were 


sixteen years old? 
Neurotic Cross 
patients section 
Yes 71% 77% 
No 
Death 17 15 
Divorce or separation 10 6 
Other 2 2 
100% 100% 


So far as you know, has anyone in your family ever had a nervous 
breakdown? (Do not include yourself if you think you have had a 
nervous breakdown.) 

Neurotic Cross 


pakents section 

No 25% 60% 
Don’t know 29 18 
Yes 46 22 

100% 100% 


2. CHILDHOOD RELATIONS WITH PARENTS 


Summary score for area 


A high score represents a feeling that parents were lenient and 
fair, that other children in the family were not preferentially treated, 
and a low score represents the opposite. 

Neurotic Cross 


Score patients section 
6, 5,4 16% 22% 
3 31 37 
2 30 26 
1,0 23 15 
100% 100% 
When you were growing up, who would you say was your mother’s 
favorite child? 
Neurotic Cross 
patients section 
Only child 8% 8% 
I was 15 17 
There were no favorites 54 56 
An older child 10 9 
A younger child 13 10 


100% 100% 


SCREENING OF PSYCHONEUROTICS 6168 


When you were growing up, who would you say was your father’s 
favorite child? 


Neurotic Cross 

patients section 

Only child 8% 8% 
I was jl 11 
There were no favorites 58 58 
An older child 11 1l 
A younger child 12 12 

100% 100% 


Do you think that your parents (or the people who brought you up) 
were stricter than other parents or not as strict as other parents? 


Neurotic Cross 


patients section 

Not as strict 15% 15% 
About as strict 67 69 
Stricter 18 16 

100% 100% 


When you were growing up, about how often were you punished by 
your parents (or the people who brought you up)? 
Neurotic Cross 
patients section 


Not often enough 17% 23% 
About the right number of times 72 74 
Too often 11 3 


100% 100% 


When your parents (or the people who brought you up) punished 
you, did you usually deserve it or not? 


Neurotic Cross 

patients section 
Always deserved it 50% 64% 
Deserved it most of the time 36 29 
Deserved it about half the time or less 14 7 


100% 100% 


When you had actually done something wrong and were punished 
for it, did you usually get an easier or a harder punishment than 
other kids you knew? 


Neurotic Cross 

patients section 
Easier 12% 14% 
About the same 71 73 
Harder 17 13 


100% 100% 
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3. CHILDHOOD FEARS 


Summary score for area 


In the area of childhood fears, a high score represents fewer fears 
reported in childhood of such things as snakes, punishment, being 


on high places, etc. 


Neurotic 
Score patients 
19 (no fears) 9% 
18-16 (1-3 fears) 29 
15-12 (4-7 fears) 31 
11-0 (8 or more fears) 31 
100% 
When you were a kid, how afraid were you of: 
Neurotic 
patients 
Strangers 
Not at all 54% 
A little 32 
Very much 14 
100% 
Being with girls 
Not at all 538% 
A little 33 
Very much 14 
100% 
Thunderstorms 
Not at all 44% 
A little 36 
Very much 20 
100% 
Being left alone 
Not at all 41% 
A little 35 
Very much 24 


Cross 

section 

25% 
43 
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Neurotic Cross 
patients _ section 
Large animals 
Not at all 43% 52% 
A little 35 37 
Very much 22 11 
100% 100% 
Being shut up in a room or closet 
Not at all 46% 62% 
A little 29 27 
Very much 25 11 
100% 100% 
Falling 
Not at all 32% 47% 
A little 35 42 
Very much 33 11 
100% 100% 
Walking by a graveyard at night 
Not at all 38% 51% 
A little 35 36 
Very much 27 13 
100% 100% 
The Devil 
Not at all 56% 65% 
A little 20 20 
Very much 24 15 
100% 100% 
Thoughts of death 
Not at all 45% 52% 
A little 28 33 
Very much 27 15 
100% 100% 
Being on high places 
Not at all 34% 49% 
A little 27 36 
Very much 39 15 


100% 100% 
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Family quarrels 


Not at all 
A little 


Very much 

Being punished 
Not at all 
A little 


Very much 


Being laughed at by other boys 


Not at all 
A little 


Very much 


Sharp knives 
Not at all 
A little 


Very much 


Getting bawled out 


Not at all 
A little 


Very much 


Being called on to recite in class 


Not at all 
A little 


Very much 


Getting a bad report card from school 


Not at all 
A little 


Very much 


Neurotic 
patients 


39% 
33 


28 


100% 


27% 
41 


32 


100% 
36% 
29 
35 

100% 


38% 
32 


30 


100% 


22% 
39 


39 
100% 


27% 
34 


39 


100% 


32% 
37 


31 
100% 
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Cross 
section 


49% 
34 


17 
100% 


32% 
51 


17 
100% 
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Neurotic 
patients 
Snakes 
Not at all 23% 
A little 24 
Very much 53 
100% 


4. CHILDHOOD NEUROTIC SYMPTOMS 


Summary score for area 


Cross 
section 


26% 
35 


The area of childhood neurotic symptoms is concerned with such 
things as general health, stuttering, stammering, nail biting and 
enuresis. A high score represents absence of these symptoms. 


Neurotic Cross 

Score patients section 

4 (no symptoms) 18% 46% 
3 (1 symptom) 29 34 
2 (2 symptoms) 28 15 
1, 0 (3 or 4 symptoms) 25 5 

100% 100% 

As far as you know, were you a healthy child or a rather sickly one? 

Neurotic Cross 

patients section 

Very healthy 21% 51% 
Fairly healthy 40 36 
Don’t know 7 3 
Rather sickly 32 10 

100% 100% 

Did you ever bite your fingernails when you were a child? 

Neurotic Cross 

patients section 

Never 40% 55% 
Sometimes 29 28 
Often 31 7 

100% 100% 
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When you were growing up, did you have any trouble with stut- 


tering or stammering in your speech? 


Never 


Sometimes 
Often 


Neurotic 
patients 
50% 


35 
15 


100% 


How old were you when you stopped wetting the bed? 


Less than 4 years old 
4-9 years 
10-16 years 


After 16 years 
Still wet the bed 


5, CHILDHOOD SCHOOL ADJUSTMENT 


Summary score for area 


Neurotic 
patients 
56% 


22 
13 


100% 


Cross 
section 


74% 


100% 


A high score indicates a liking for school and a reasonable degree 


of success there. 


Neurotic 
patients 


10% 
34 


29 
27 


100% 


As a child how did you feel about going to school? 


Really wanted very much to go 


Didn’t care very much whether I went 
or not 
Did not at all want to go 


Neurotic 
patients 
48% 


100% 


Cross 
section 
19% 
30 


32 
19 
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When you were a kid, how often would you say you played hookey 


from school (stayed away as much as a day)? 


Neurotic Cross 

patients section 

Never 27% 28% 
Only once or twice 33 40 
Several times 31 27 
Very often 9 5 

100% 100% 

What kind of grades did you usually get when you were in school? 

Neurotic Cross 

patients section 

Very high 4% 5% 
High 17 27 
Medium 62 64 
Low ll 3 
Very low 6 1 

100% 100% 


6. CHILDHOOD FIGHTING BEHAVIOR 


Summary score for area 


Good adjustment is defined as some degree of aggressiveness as 


indicated by getting into, holding one’s own in, and not being 
afraid of fights. 
Neurotic Cross 
Score patients section 
3 29% 43% 
2 26 27 
1 23 19 
0 22 11 
100% 100% 
How often did you get into fights when you were a kid? 
Neurotic Cross 
patients section 
Very often 21% 16% 
Sometimes 38 46 
Almost never 41 38 
100% 100% 


521 


522 


SCREENING OF PSYCHONEUVUROTICS 


How did you feel about fighting as a kid? 


Neurotic Cross 
patients section 
Really liked fighting 6% 12% 
Didn’t particularly like or dislike it 38 54 
Didn’t like fighting at all 56 34 
100% 100% 
Were you a good fighter as a kid? 
Neurotic Cross 
patients section 
Yes 22% 39% 
Undecided 36 
No 42 28 
100% 100% 
7. CHILDHOOD PARTICIPATION IN SPORTS 
Summary scores for area 
(a) Number of sports engaged in 
Neurotic Cross 
patients section 
6, 5, 4 8% 12% 
3, 2 30 48 
1 26 25 
0 38 15 
100% 100% 
(b) Type of sports engaged in 
Neurotic Cross 
patients section 
Both body-contact and non-body- 
contact sports 26% 49% 
Body-contact sports only 14 19 
Non-body-contact sports only 22 17 
None 38 15 
100% 100% 
As a boy (before you were 18 years old), how much did you take 
part in these sports? 
Neurotic Cross 
patients section 
Baseball 
A lot 31% 54% 
A little 43 36 
Not at all 26 10 
100% 100% 
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Neurotic Cross 
patients section 


Fishing or hunting 
A lot 39% 50% 
A little 29 31 
Not at all 32 19 
100% 100% 
Football 
A lot 18% 38% 
A little 30 37 
Not at all 52 25 
100% 100% 
Basketball 
A lot 18% 32% 
A little 32 42 
Not at all 50 26 
100% 100% 
Tennis 
A lot 7% 12% 
A little 20 32 
Not at all 73 56 
100% 100% 
Golf 
A lot 4% 7% 
A little 15 22 
Not at all 81 71 
100% 100% 


8. EMANCIPATION FROM PARENTS 
Summary score for area 

This score has two main components: the actual physical emanci- 

pation involved in leaving home and the psychic emancipation im- 

plied by establishment of satisfactory heterosexual relationships; 30 

a high score in this area represents good adjustment in both re- 


spects. 
Neurotic Cross 
Score patients section 
6, 5 13% 14% 
4,3 45 50 
2,1 29 29 
0 13 7 


100% 100% 
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How old were you when you started living away from your parents 
or the family that brought you up? 
Neurotic Cross 
patients _ section 


Less than 15 years old 10% 6% 
15-17 years 13 ll 
18-21 years 22 29 
22 years and over 18 14 
Never lived away 37 40 
100% 100% 


Before you came into the Army, what was the longest you had ever 
been away from your family or anybody related to you? (By 
“family” is meant your parents, brothers and sisters, or people 


who brought you up.) 

Neurotic Cross 
patients section 
1 year or more 27% 26% 

1 month to 1 year 29 34 

1 week to 1 month 19 21 

Less than 1 week 15 12 

Never away 10 7 
100% 100% 


Before you came into the Army, what was the longest you had ever 
been away at any one time from your parents or the family that 
brought you up? (If you left home to get married, count the time 
you were away after you were married.) 


Neurotic Cross 

patients section 

1 year or more 35% 31% 
1 month to 1 year 26 34 
1 week to 1 month 18 21 
Less than a week 15 9 
Never away 6 5 

100% 100% 


How old were you when you first started going around by yourself 

on dates with girls (not on double dates or in large groups)? 
Neurotic Cross 
patients section 


Less than 15 years old 7% 7% 
15-16 years 20 30 
17-18 years 33 38 
19-20 years 20 15 
21 years or over 12 6 
Never dated 8 4 


100% 100% 
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Did you usually have dates with girls more often or leas often than 


most other fellows of about your own age that you knew? 


Neurotic Cross 

patients section 

More often 138% 13% 
About the same 45 61 
Not as often 42 26 

100% 100% 

How old were you when you first started going steady with one girl? 

Neurotic Cross 

patients section 

Less than 18 years old 16% 20% 
18-20 years 35 40 
21-23 years 19 16 
23 years or more (ls 8 
Never went steady 15 16 

100% 100% 


9. MOBILITY 


Summary score for area 


A high score represents a stable employment record, regular 


church attendance and little geographical moving around. 


Neurotic 
Score patients 
4 13% 
3 32 
2 35 
1,0 20 
100% 


Cross 
section 


14% 
37 


36 
13 


100% 


Before you came into the Army, how many different states of the 


United States had you been in to live or work? 


Neurotic 

patients 
One 45% 
Two 23 
Three 11 
Four 7 
Five or more 14 


100% 


Cross 
section 
43% 


21 
13 

8 
15 


100% 


526 


SCREENING OF PSYCHONEUROTICS 


How many jobs did you have in the last two years before you came 
into the Army? 


Neurotic Cross 

patients section 
One 45% 47% 
Two 29 32 
Three 10 12 
Four or more 12 6 
Never had a job 4 3 


100% 100% 


Before you came into the Army, had you ever been fired or asked to 
resign from a job? 

Neurotic Cross 

patients section 

No 78% 89% 


Never had a job 4 3 
Yes 18 8 
100% 100% 


Before you came into the Army, about how often did you go to 
church? 
Neurotic Cross 
patients section 


Once a week or more 34% 39% 
One to three times a month 26 29 
Several times a year 22 16 
Almost never 18 16 
100% 100% 


10. socrIABILITY 


Summary score for area 


A high score represents a liking for and association with other 
people. 


Neurotic Cross 
Score patients section 
3 22% 43% 
2 33 41 
1 26 12 
0 19 4 


100% 100% 
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How would you say the people you know feel about you? 


Neurotic 
patients 
Almost all of them like me 46% 
Most of them like me 39 
A few of them like me 12 
Almost none of them like me 3 
100% 


Before you came into the Army did you usually go around 
bunch of others or by yourself? - 


Neurotic 
patients 
With a bunch of others 18% 
With just one or two others 49 
By myself 33 
100% 


Cross 
section 


53% 


43 
3 
1 


100% 
with a 
Cross 
section 
38% 
52 
10 
100% 


On the whole, before you came into the Army did you usually like 


to be by yourself or to be with other people? 


Neurotic 
patients 
With other people 44% 
Undecided 15 
By myself 4\ 
100% 


11. IDENTIFICATION WITH THE WAR EFFORT 


Summary score for area 


Cross 
section 


78% 


7 
15 


100% 


Identification with the war effort is defined as a feeling that the 
war is worth fighting and a recognition of one’s own personal stake 
in it, with a high score representing identification and a low score, 
lack of identification. 


Neurotic Cross 
Score patients section 
5 43% 49% 
4 26 27 
3,2 20 18 
1,0 ll 6 


100% 100% 
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Do you think you have as much of a personal stake in this war as 
anybody else? 


Neurotic Cross 

patients section 
Yes 68% 7% 
I think so but I’m not sure 13 9 
Undecided 12 7 
No 7 7 


100% 100% 


In your opinion is the United States fighting for things that you feel 
are worth fighting for? 


Neurotic Cross 

patients section 

Yes 84% 89% 
I think so, but I’m not sure 6 5 
Undecided 6 4 
No 4 2 

100% 100% 


Do you think it would make much difference to you personally if we 
did not win this war? 


Neurotic Cross 

patients section 

Would make a great deal of difference 85% 94% 
Quite a bit 10 5 
Not so much 4 1 
Not any 1 0 

100% 100% 


Do you ever get the feeling that this war is not worth fighting? 


Neurotic Cross 
patients section 


Never 64% 65% 
Only once in a great while 17 18 
Sometimes 13 12 
Very often 6 5 
100% 100% 


Do you think the things we are fighting for are worth risking your 
life for? 

Neurotic Cross 

patients section 

Yes 74% 83% 


I think so, but I’m not sure 13 9 
Undecided 8 5 
No 5 3 
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12. ACCEPTANCE OF SOLDIER ROLE 


Summary score for area 


A high score in the area of acceptance of soldier role represents a 
feeling that being drafted was fair in one’s own case, a choice of the 
job of a soldier in preference to one as a war worker, and a desire 
for overseas service. 


Neurotic Cross 
Score patients section 
4,3 16% 46% 
2 25 23 
1 32 20 
0 27 11 
100% 100% 
At the time you came into the Army did you think you should have 
been deferred? 
Neurotic Cross 
patients section 
I wasn’t drafted 18% 25% 
No 38 50 
Yes 
Health 35 9 
Dependents 5 8 
Job 3 6 
Other 1 2 


100% 100% 


If it were up to you to choose, do you think you could do more for 
your country as a soldier or as a worker in a war job? 

Neurotic Cross 

patients section 

Soldier 10% 40% 


Undecided 18 15 
War worker 72 45 
100% 100% 
If it were up to you and you yourself had to decide, would you choose 
to be a soldier or a civilian? 
Neurotic Cross 


patients section 
Soldier 21% 44% 


Undecided 23 13 
Civilian 56 43 


100% 100% 
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If it were up to you, what kind of outfit would you rather be in? 


Neurotic Cross 
patients section 


Combat overseas 35% 49% 
Noncombat overseas 16 21 
An outfit that will stay in U.S. 49 30 


100% 100% 


13. WORRYING 


Summary score for area 


The worrying score, as its name implies, measures the extent to 
which a person worries about himself and the future, with a high 
score indicating a relatively unworried person. 


Neurotic Cross 

Score patients section 

4 25% 50% 
3 26 25 
2 24 15 
1,0 25 10 

100% 100% 


Do you worry very much about things that might happen to you? 


Neurotic Cross 

patients —_section 
Not at all 16% 26% 
Not very much 33 55 
Quite a bit 27 14 
A great deal 24 5 


100% 100% 


When you get a very important job to do, do you worry about 
whether you will do all right or not? 


N eurotic Cross 


patients section 
Not at all 12% 18% 
A little 38 56 
A lot 50 26 


100% 100% 
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Do you ever worry about whether you will be injured in combat 
before the war is over? 
Neurotic Cross 
patients section 


Never 39% 41% 
Hardly ever 29 36 
Fairly often 22 17 
A great deal 10 6 


100% 100% 


Do you ever seriously worry about whether or not there will be a 
real depression after this war? 


Neurotic Cross 

patients section 

Almost never 37% 36% 
Sometimes 41 43 
Very often 22 21 

100% 100% 


14. OVERSENSITIVITY 
Summary score for area 


The area of oversensitivity is concerned with such things as being 
easily offended, resentful of criticism, irritable, etc., a high score 
indicating an absence of the traits grouped together as oversensi- 


tivity. 

Neurotic Cross 
Score patients section 
10, 9,8 29% 58% 

7,6 25 28 

5, 4 23 13 

3, 2, 1, 0 23 6 
100% 100% 


Do you often say things you later wish you hadn’t said? 


Neurotic Cross 
patients section 


Almost never 20% 28% 
Sometimes 54 60 
Very often 26 12 


100% 100% 
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How often do people get on your nerves so that you want to do just 
the opposite of what they want you to do? 


Neurotic Cross 

patients section 
Almost never 23% 37% 
Sometimes 46 50 


Very often 31 13 


100% 100% 


Do you ever feel that people in the Army criticize you too much 
without any good reason? 
Neurotic Cross 
patients section 


Almost never 28% 42% 
Only once in a while 34 41 
Fairly often 21 11 
Nearly all the time 17 6 
100% 100% 


How often does it make you sore to have people tell you what to do? 


Neurotic Cross 
patients section 


Almost never 2% 29% 

Sometimes 47 53 

Very often 29 18 
100% 100% 


Do you ever go out of your way to make things tough and un- 
pleasant for somebody you don’t like? 


Neurotic Cross 
Y section 
Almost never 74% 81% 

Sometimes 22 17 

Very often 4 2 
100% 100% 

Do you ever feel like smashing things for no good reason? 

Neurotic Cross 


Almost never 43% 11% 


Sometimes 37 25 
Very often 20 4 


100% 100% 
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How much does it bother you when you are ordered to do some- 


thing you don’t see a good reason for doing? 


Neurotic Cross 
patients section 
Not at all 10% 9% 
Very little 17 24 
Some 34 37 
A great deal 39 30 
100% 100% 
Did people seem to think you had a hot temper as a kid? 
Neurotic Cross 
patients section 
No 40% 58% 
Undecided 16 10 
Yes 44 32 
100% 100% 
Do you find that you often have to tell people to mind their own 
business? 
Neurotic Cross 
patients section 
Almost never 32% 51% 
Sometimes 48 42 
Very often ' 20 7 
100% 100% 
How often do people hurt your feelings? 
Neurotic Cross 
patients section 
Almost never 20% 44% 
Sometimes 52 50 
Very often 28 6 
100% 100% 


15. PERSONAL ADJUSTMENT 


Summary score for area 


In this area are grouped questions dealing with self-confidence, 
and lack of self-pity, depression and anxiety. A high score repre- 


sents possession of the traits just enumerated. 


Neurotic 
Score patients 
6, 5, 4 8% 
3, 2 25 
1 23 
0 44 
100% 


Cross 
section 


33% 
37 


18 
12 


100% 


534 


SCREENING OF PSYCHONEUROTICS 


In general, how would you say you feel most of the time, in good 
spirits or in low spirits? 
Neurotic Cross 
patients section 


Good spirits 11% 37% 
Good spirits sometimes; 

low spirits sometimes 44 50 
Low spirits 45 13 


100% 100% 


Are you ever worried and upset? 


Neurotic Cross 

patients section 

Hardly ever 11% 37% 
Sometimes 34 48 
Often 55 15 

100% 100% 


Do you feel that you get more than your share of bad luck? 


Neurotic Cross 
patients section 


Almost never 19% 32% 

Sometimes 51 52 

Very often 30 16 
100% 100% 


In general, what sort of time do you have in the Army? 


Neurotic Cross 
patients section 


Pretty good 14% 24% 

It’s about fifty-fifty 48 57 

Pretty rotten time 38 19 
100% 100% 


Do you ever get so blue and discouraged that you wonder whether 
anything is worth while? 


Neurotic Cross 

patients section 
Hardly ever 15% 36% 
Not so often 23 35 
Pretty often 31 17 
Very often 31 12 


100% 100% 
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Do you think you can make good in the Army? 


Neurotic Cross 
patients section 


Very sure I can make good 14% 49% 
I think so but I’m not sure 12 19 
Undecided 31 20 
Don’t think I can make good 43 12 


100% 100% 


16. PSYCHOSOMATIC COMPLAINTS 


Summary scores in this area are graphed in Chart I in Part 1. 
(1) Do you have any particular physical or health problem?™ 


Neurotic Cross 

patients section 

2. No 8% 57% 
3. Undecided 7 8 
1. Yes 85 35 

100% 100% 


(2) Do you often have trouble in getting to sleep or staying asleep? 


Neurotic Cross 

patients section 
3. Almost never 7% ~~ 48% 
2. Sometimes 33 39 


1. Often 60 13 


100% 100% 


(3) Do your hands ever tremble enough to bother you? 


Neurotic Cross 
patients section 
3. Never 20% 68% 


2. Sometimes 39 25 
1. Often 41 7 


100% 100% 


% The number in parenthesis, referring to the question number, and the numbers 
opposite the subcategories will facilitate subsequent cross reference from Tables 9 and 
10 in Part 3. 
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(4) Have you ever had any fainting spells? 


3. Never 38% 78% 
2. A few times 41 18 
1. Several times 21 4 
100% 100% 
(5) Are you ever bothered by nervousness? 
Neurotic Cross 
patients section 
3. Never 5% 35% 
2. Sometimes 25 48 
1. Often 70 17 
100% 100% 


(6) Have you ever been bothered by your heart beating hard? 
Neurotic Cross 


3. Never 14% 46% 
2. A few times 41 41 
1. Often 45 13 
100% 100% 
(7) Have you ever been bothered by pressures or pains in the head? 
Neurotic Cross 
patients section 
3. Never 16% 50% 
2. Sometimes 37 37 
1. Often 47 13 
100% 100% 


(8) Have you ever had spells of dizziness? 
Neurotic Cross 
patients section 


3. Never 10% 44% 
2. A few times 45 45 
1. Many times 45 ll 


100% 100% 
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(9) Do you ever bite your fingernails now? 
Neurotic Cross 
patients section 


3. Never 47% 68% 
2. Sometimes 30 22 
1. Often 23 10 


100% 100% 


(10) Have you ever been bothered by shortness of breath when you were 
not exercising or working hard? 
Neurotic Cross 


patients 
3. Never 19% 57% 
2. Sometimes 41 31 
1. Often 40 12 
100% 100% 


(11) Are you ever troubled by your hands sweating so that they feel 
damp and clammy? 
Neurotic Cross 


patients section 

3. Never 10% 39% 
2. Sometimes 33 ee) 
1. Often 57 18 

100% 100% 


(12) Are you ever troubled with sick headaches? 
Neurotic Cross 
patients section 
3. Never 14% 52% 


2. Sometimes 44 38 
1. Often 42 10 
100% 100% 


(13) How often are you bothered by having an upset stomach? 
Neurotic Cross 
patients section 

4, Never 5% 22% 
37 


3. Not very often 59 
2. Pretty often 38 15 
1. Nearly all the time 20 4 


100% 100% 
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(14) Are you ever bothered by having nightmares (dreams that frighten 
or upset you very much)? 

Neurotic Crosa 

patients section 

3. Never 20% 64% 


2. A few times 49 32 
1. Many times 31 4 


100% 100% 


(15) Have you ever been troubled by ‘‘cold sweats”? 
Neurotic Cross 
patients _ section 
3. Never 20% 58% 


2. A few times 54 37 
1. Often 26 5 


100% 100% 


PART 8 


METHOD OF DERIVING WEIGHTS 


The method used in deriving weights from the multiple regres- 
sion of the criterion on the fifteen items in the initial test battery 
was an iterative technique.” From the point of view of sampling 
error, this method has an advantage over the Doolittle and related 
methods in that the iterations could be stopped at any point where 
it was apparent that chance was entering too much. Furthermore, 
it was far easier to compute. The basic data are presented in 
Table 9. In this table each entry represents a joint occurrence of 
two categories, that is, the proportion who answered the two indi- 
cated categories simultaneously. Thus a diagonal entry repre- 


7 See Louis Guttman, “An Iterative Method for Multiple Correlation,” in P. Horst, 
et al., The Prediction of Personal Adjustment (Social Science Research Council, New 
York, 1941), pp. 313-318. 

% A word should be said about how the cross section and the psychoneurotic samples 
were weighted in constructing this table. As was indicated earlier, the proper weighting 
to maximize the discrimination was one which would equalize the variances of the two 
groups’ prediction scores. Since on other scorings of the psychosomatic complaints, 
the scores of the psychoneurotic group were noted to have smaller total variances than 
those of the cross section, the former group was given the heavier weight. It was esti- 
mated, without bothering to solve the equations for the proper weights, that a ratio of 
9 to 1 would be about right, and this is the weighting used in Table 9. As it turned out, 
this was something of an overweighting of the psychoneurotic group, but it made little 
difference. The proper ratio should be less than 9 to 1 and more than 1 to 1, but would, 
in any case, not have given different results from those obtained, as far as total discrimi- 
nation is concerned. 
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sents the proportion answering the indicated category (since it 
always occurs with itself); two categories within the same question 
have a zero joint occurrence, since they are mutually exclusive ; and 
two categories from different questions have relative frequencies 
bigger than zero. The sum of each three-by-three square in the 
table, where the rows belong all to one question and the columns 
belong all to another question, is unity. 

For the first iteration, zero-order weights were used, that is, the 
least square weights that an item would be given if it were to be the 
only predictor. These are quite simple to compute for qualitative 
items, being simply the ratio of the joint occurrence of a category 
with the criterion to the total frequency of that category. Adding 
any constant to the weights of all the categories of a question simply 
changes each person’s total score by exactly that constant, leaving 
the correlation with the criterion unchanged. Therefore, to sim- 
plify further computations, a constant equal to the weight of the 
top category was subtracted from the category weights of each 
question, thus automatically reducing the weight of the top cate- 
gory to zero. The work from then on proceeded with Table 10, 
which is basically Table 9 with the categories with zero weight 
eliminated. In Table 10, a dummy variable was added to represent 
the additive constant for the regression equation; that is, everybody 
had a score of unity on the dummy variable, so its diagonal entry is 
unity, and its nondiagonal entries are simply the diagonal entries 
of the corresponding categories. 

If x;; represents the entry in row 7 and column j of Table 9 and if 
c; represents the entry in the 2th row of column C, then the multiple 
regression weights, w;, are defined to be those weights which satisfy 
the normal equations: 


Ci— ) Wai; = 0 
d 


The iterative process first tests the trial weights w1; to see to what 
extent they satisfy the normal equations. These weights are of 
course not in scale and must be divided by a constant. The column 
2Zw,z in Table 10 when multiplied by a constant” to yield the column 
kZw,z shows to what extent the zero-order weights are proportional 
to the multiple regression weights. It may be noted how close to 
zero are the entries in the last column of Table 10. If these were 


™ The constant employed is the slope of the line of best fit of the Zw,z’s to the c’s. 
In this case, k = .231. 
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TABLE 9 


Joint OccuRRENCES OF PsycHosomaTIC COMPLAINTS: 
Cross Sncrion aND PsycHongunotics WEIGHTED ONE To NINE AND COMBINED 


(January-February 1944) 


ea 
font ! 4 3 4 
Subcategory &£ $s 1 $s 2 1 $s 2 1 $s 8 1 
2 -127 -039 .050 .038 071 .035 .021 086 .032 .010 
1 3 073 009 .038 .026 018 .039 .015 -025 .031 .016 
1 .800 061 .244 .495 -187 301.342 -808 .322 .170 
3 -039 .009 .061 -109 .078 .022 .008 .072 .024 .013 
2 2 050 .038 .244 332 -100 .149 .083 -152 .134 047 
1 .038 .026 .495 .559 .069 .203 .288 -106 .227 .136 
3 -O71 .018 .157 .078 .100 .069 -247 153. .071 + .023 
3 2 035 .039 .301 .022 .149 .203 874 -135 166.074 
1 021 .015 .842 008 .083 .288 379 -180 .149 .000 
3 086 .025 .308 O72 .152 .196 -153 .135 = .130 418 
4 2 .032 .031 .322 024 .134 .227 .071 .166 .149 386 
1 010 .016 .170 013 047.136 023 .074 .090 196 
3 -038 .004 .036 039 wae -007 066 .011 .002 -061 .012 .005 
5 2 047 .039 .187 -041 .099 -119 .136 .019 -141 .090 .034 
1 042 .029 .578 029 “Tee 452 063 .228 .358 -217 .275 = .187 
3 048 .016 .105 -046 .061 .062 002 .048 .020 -102 .052 .015 
6 2 .051 .040 .319 042 .156 .211 100 .167  .142 -174 .178 .080 
1 -029 .017 .376 021 .115 .287 056 .159 .207 -143 156 
3 056 .011 .127 053 .084 .057 -009 .048 .047 -121 .059 .014 
7 2 -048 .024 . -036 .143  .187 -087 .160 .119 -157 148 =.060 
1 024 .038 .379 020 .1 315 061 .166 .213 140 .179 .122 
3 041 045 .063 .026 074 026 110 .017 .006 
8 2 038 .353 9 .175 .226 116 6185) 148 212 .192 047 
1 026 .026 .865 015 5 .307 057.156 8 .177 43 
3 078 .0: 379 074 .185 .227 -172 7 «187 9 .168 .069 
9 2 038 .022 023 .098 .173 052 .116 .1 102. .123 
1 O11 .021 .188 O12 .049 .159 023 .082 .116 8 059 
3 062 .019 .148 065 .080 .084 116 .073 .039 140 .064 .026 
10 2 040 .031 .320 028 .144 .228 073 .175 .152 175 157) = .066 
1 025 .022 .322 016 .108 .247 058 .127 .186 102 .164 .104 
3 046 .015 .068 -037 .055 .037 075 .034 .020 072 .035 .022 
11 2 048 .025 049 .127 .161 097 .154 157.125 .055 
1 033 .032 .470 150 .862 074 .188 .273 190 .225 .120 
3 054 .010 .115 057 .078 .044 101 .045 .032 123.041 .014 
12 2 0. 038 .342 164 .234 184.157 188 .174 2 
1 017 .026 .344 015 281 052 .146 107. 170.3110 
4 021 .005 .044 .024 .024 .023 036 .022 .012 049 .014 .007 
13 3 073 .032 .285 -055 .176 .160 141.143 .108 .187 .144 .060 
2,1 033 .036 .470 030 .133 .376 071 .210 .257 -183 227.128 
3 060 .019 .163 070 .098 .075 -131 .077 .036 -188 .062 .022 
14 2 -048 .031 .398 032 .184 .260 103.198 .175 -193 .200 .075 
1 019 .022 .240 007 .050 .224 013.100 .167 067 .115 .000 
3 065 .019 .153 061 .099 .078 115 .078 .044 183 .062 .022 
15 2 052 .045 .422 041 .183 .205 -107 .225 = .187 -211 .210 .008 
1 010 .009 .224 -007 .050 .187 025 .072 .147 054 .118 077 


*® The exact wordings of questions and subcategories are shown in Part 2 of this chapter. In that part, 
the questions here referred to by number are indicated by a parenthetical number preceding the ques 
tion. The subcategories in this table follow the numbering used in Part 2. 
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TABLE 9 (Continued) 


ton 6 6 7 8 
Subcategory s 8 1 3s #2 1 s os 1 s ¢ 1 
2 038 .047 .042 .048 .051 .029 056 .048 .024 .041 .060 .026 
1 3 .004 .039 .029 .016 .040 .017 .011 .024 .038  .009 .038 .026 
1 .036 .187 .578 .105 .819 .876 .127 .204 .379 .082 3853 .365 
3 -039 .041 .029 046 .042 .021 -053 .036 .020 045 .049 .015 
2 2 082 .134 .166 O61 .156 .115 084 .143 .106 063 .175 .005 
1 007 .090 .452 062 .211 .287 -057 .187 315 026 .225 .307 
3 .066 .119 .063 092 .100 .056 -099 .087 .061 -074 .116 .057 
3 2 O11 .136 .228 048 .167 .150 -048 .160 .166 .033  .185 .156 
1 002 .019 .358 .029 .142 .207 047 .119 .213 026 .148 .204 
3 061 .141 .217 -102 174 = .143 121. .157) .140 110.212 .098 
4 2 O12 .090 .275 052 .178 .156 -059 .148 .179 017. .192 177 
1 005 .034 .157 015 .059 .123 014 .060 .122 .006 .047 .143 
3 078 045 .023 .010 .049 .016 .013 041 .026 .011 
5 2 .273 .072 .1386 .066 063.133 .078 = «052s «151.071 
1 649 .052 .251 .846 .082 .217 .349 041 .273 .334 
3 -045 .072 .052 -169 -074 .054 .040 .051 .085 .032 
6 2 é 136.251 409 -087 184 .188 8 =6.051  .233)_.126 
1 010 .066 .346 422 .033  .126 .262 .082 .132 .259 
3 049 .063 .082 .074 .087 .033 104 083 .086 .025 
7 2 016.1383 .217 .054 .184 .126 366 038 .210 .118 
1 013 .078 .349 040 .138 .262 440 012 .154 .275 
3 041 .052 .041 051 .051 .032 -083 .038 .012 133 
8 2 .026 .151 .273 -085 .233 .132 -086 .210 .154 450 
1 -011 071 .334 .082 .126 .259 025 .118 .275 A417 
3 065 .159 .261 104 .207 .175 -127 .180 .180 085 .225 .177 
9 2 009 .081 .204 048 .111 .135 041 .110 .143 036 .120 .129 
1 003 .034 .183 .017 13000 027, «076.117 012 097 .112 
3 051 .088 .087 -058 .086 -057 067 .100 .052 
10 2 021 .119 .260 -057  .199 .145 163 .175 047 .202 .151 
1 006 .067 .298 .025 .126 .219 047 .117 .207 019 .138 .213 
3 035 .058 .036 .058 027 .058 .037 038 .061 .030 
11 2 030 .126 .180 058 .173 .105 072 .148 .117 O57 .163 117 
1 013 .090 .432 054 .191 .289 064 .181 039 .226 .270 
3 048 .073 .057 063 .074 .041 -105 .053 .020 077 .074 .027 
12 2 022 .143 070 .186 .179 075 .242 .119 047 .240 .147 
1 008 .057 .322 .037 .149 1 015 .070 .302 009 .135 .242 
4 020 025 029 .028 .013 -038 .018 .014 027 .032 .011 
13 3 046 .148 .197 000 .164 .137 -095 .138 .158 O73 .195 .124 
21 012 .101 .426 050 .218 .271 -061 269 033 3.282 
3 054 .111 .077 O81 .101 .061 -087 .087 .068 O77 .119 .046 
14 2 132 .324 -216 .185 -080 6 6 .194 
1 003 .031 .247 014 .002 .174 017 166 010 .004 .177 
3 045 .098 .004 077 .099 .061 -088 062 072 .115 .050 
15 2 028 337 082 .239 .198 -080 .216 .224 O51 .258 .209 
I 022 .217 010 .072 .163 -026 .064 .154 010 .076 .157 
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TABLE 9 (Continued) 


ee- 
Pb 9 10 il 12 
Subcategory 3 2 1 $s 2 1 s 2 1 8 & 1 
2 078 .038 .O11 062 .040 .025 046 .048 .033 -054 .056 .017 
1 3 -029 .022 .021 -019 .031 .022 -015 .025 .032 .010 026 
1 -879 .233  .188 +148 329 .322 .068 .263 .470 115.342 1344 
3 074 .023 .012 065 .028 .016 .037 .049 .023 057 .037 .015 
2 2 -185 .098 .049 080 .144 .108 055 .127 .150 -078 .164 .090 
1 -227 173.150 -084 .228 .247 037 .161 .862 044 .234 .281 
3 +172 .052 + .023 -116 .073 .058 .075 .007 .074 -101  .004 .052 
3 2 -177° .116 .082 -073 .175 .127 034 .154 .188 045 1184 .146 
i +137) 125.116 -039 .152 .186 020 .085 .273 -032 .157 .189 
3 -249 .102 .068 140 .175 = .102 072 .157 .190 -123 188.107 
4 2 -168 .123 .004 -064 .157 .164 -035 .125 .225 041.174 .170 
1 -069 .069 .059 -026 .066 .104 022 .055 .120 014 .072 .110 
3 065 .009 .003 -051 .021 .006 035 .030 .013 -048 .022 .008 
5 2 -150 .081 .034 -088 .110 .067 -058 .126 .090 073.143 .057 
1 -261 .204 .183 -080 .260 .208 -036 .180 .432 .057 .270 .322 
3 -104 .048 .017 .087 .057 .025 058 .058 .054 -063 .070 .037 
6 2 -207 .111 .002 -084 .199 .126 -044 .173 .191 O74 .186 .149 
1 175.185 ~.110 058 .145 .219 027.105 .289 041 .179 .201 
3 +127) .041 027 : -062 .047 058 .072 .064 105.075 .015 
7 2 -180 .110 .076 -086 .163 .117 -037 148.181 053.242 .070 
1 -180 .143) 117 -087 .175 .207 ‘ +117 .290 -020 .119 
3 085 .036 .012 -067 .047 .019 -038 .057 .039 077 .047 .009 
8 2 +225 .129 .097 -108 .202 .138 061 .163 .226 074 .240 .135 
1 177) «120.112 -052 .151  .213 030.117 .270 027 147.242 
3 486 -160 .193 143 O87 .179 .221 116.221. 150 
9 2 204 -049 .129 .116 026 .104 .164 040.131 .123 
1 -220 -030 .078 .112 017.053 .150 022 .083 .114 
3 +150 .049 .030 -229 066 .085 .078 091 .083 .054 
10 2 -193 .129 .078 400 039 .156 205 052 .201 .148 
1 +1438) 116 112 371 .024 .005 .252 -035 .152 .185 
3 -087 .026 .017 .066 .039 .024 -129 056 .056 .017 
11 2 +179 .104 = .053 085 .1566 .005 836 068 .155 .113 
1 -221 .164 .150 078 .205 .252 -535 055 .224 .257 
3 -116 .040 .022 001 .052 .035 056 .068 .055 178 
12 2 +221 131 .083 083 .201 .152 056 .155 .224 435 
1 150° .123 114 054 .148 .185 O17 113.257 387 
4 047 .010 .013 036 .022 .013 031 .027 .012 038 .021 O11 
13 3 -206 .118 .068 «111.170 = .110 -067 .158 .167 096 .100 .104 
21 -232 .165 .140 -082 .200 .248 032 .150 .356 044.223 .271 
3 -159 .053 .031 -111) .085 = .047 072 .092 .079 091 .101 .051 
14 2 +243 .138) .095 O91 .225 .161 045 .185 .246 072 .228 177 
1 -084 .103 .004 -027 .002 .162 011 .060 .210 015 .106 .160 
3 -162 .053  .032 -103 .081 .053 079 .087 .071 002 .095 .050 
15 2 -250 .165 .104 -000 .246 .174 039 .214 .266 063 .255 .201 
1 .084 .076 .083 .027 .073 .144 011 .035 .198 022 .086 .136 


SCREENING OF PSYCHONEUROTICS 


TABLE 9 (Continued) 


Zero-order 
weights 

Joint Ratio Cor- 

occur- of oc- rected 
rence cur- to have 

with rence sero 

crt- with origin 

gue ee & 

rion 
fon 1s 14 15 — 
o- 
Subcategory 4 8 8,1 $s 2 1 $s 2 1 quency 

2 .021 .073 .033 -060 .048 .019 .065 .052 .010 -070 -553 -000 

1 3 .005 .032 .036 .019 .031 .022 -019 .045 .009 -067 911 358 
1 .044 .285 .470 163 .388 .240 «153.422 .224 -763 954 401 

3 .024 .055 .030 -070 .032 .007 -061 .041 .007 061 555 .000 

2 2 -024 .176 .133 .098 .184 .050 -099 .183 .050 203 884 .329 
1 .023 .160 .376 .075 .260 .224 .078 .295 .187 546 976 421 

3 -036 .141 .071 -131 .103 .013 «115 .107 .025 179 724 .000 

3 2 -022 .143 .210 -077 .198 .100 -078 .225 .072 349 933 209 
1 .O12 .108 .257 .036 .175 .167 044 .187 .147 372 982 258 

3 -049 .187 .183 -158 .193 .067 -153 .211 .054 .340 814 .000 

4 2 -014 .144 .227 .062 .209 .115 -062 .210 .113 -367 953 -139 
1 .007 .060 .128 022 .075 .099 .022 .098 .077 193 980 166 

8 -020 .046 .012 -054 .020 .003 .045 .028 .005 -042 544 .000 

5 2 026 .148 .101 -111 .132 .031 -008 .154 .022 -226 -826 -282 
1 .025 .197 .426 .077 .324 .247 -004 .337 .217 632 -974 430 

3 -029 .080 .050 -081 .075 .014 .077 .082 .010 123 731 -000 

6 2 .028 .164 .218 -101 .216 .002 -099 .239 .072 368 899 168 

1 -013 .137 .271 -061 .185 .174 -061 .198 .163 409 968 . 

3 .038 .095 .061 -087 .090 .017 -088 .080 .026 144 741 .000 

7 2 .018 .138 .209 -087 .180 .099 -086 .216 .064 329 800 -159 
1 .014 .158 .269 -068 .206 .166 -062 .224 .154 427 970 -229 

3 -027 .073 .033 .077 .046 .010 -072 .051 .010 .090 674 .000 

8 2 -032 .195 .223 -119 .236 .004 -115 .258 .076 405 -800 -226 
1 -011 .124 .282 -046 .194 .177 -050 .209 .157 405 972 -208 

3 -047 .206 .232 -159 .243 .084 -152 .250 .084 A18 861 -000 

9 2 -010 .118 .165 -053 .138 .103 -053 .165 .076 272 925 064 
1 -013 .068 .140 -031 .095 .094 .032 .104 .083 210 953 .002 

3 -036 .111 .082 -111 .091 .027 -103 .099 .027 172 750 -000 

10 2 .022 .170 .209 -085 .225 .002 081 .246 .073 370 024 174 
1 -013 .110 .248 -047 .161 .162 -053 .174 .144 358 967 217 

3 -081 .067 .032 .O72 .045 .O11 -079 .039 .O11 000 697 000 

ll 2 .027 .158 .150 -092 .185 .060 -087 .214 .035 -293 874 177 
1 -O12 .167 .356 -079 .246 .210 .071 .266 .198 517 -965 268 

3 -038 .096 .044 .091 .072 .015 -092 .063 .022 -126 -709 -000 

12 2 -021 .190 .223 -101 .228 .1 -095 .255 .086 -398 914 205 
1 -O11 .104 .271 .051 .177 .160 -050 .201 .136 376 072 263 

4 -070 -048 .019 .003 -O41 .025 .005 .049 691 -000 

13 3 892 -119 .210 .062 123 .221 .048 -332 848 -157 
22 -538 .076 .247 .216 -074 .278 .191 -619 065 274 

3 -048 .119 .076 243 -106 .105 .031 -179 738 .000 

14 2 -019 .210 .247 476 -112 .288 .076 445 034 -106 
1 .003 .062 .216 -281 -019 .125 .137 -276 -983 245 

3 -041 .123 .074 -106 .112 .019 -237 179 755 -000 

15 2 -025 .221 .273 -105 .288 .125 519 482 928 178 
1 005 .048 .191 -031 .076 .137 -244 239 081 -226 


TABLE 10 


DERIVATION OF REGRESSION WeiGHTs FROM JOINT OCCURRENCES OF PSYCHOSOMATIC 


Compiaints: Cross SECTION AND PsycHONEUROTICS WEIGHTED ONE TO NINE AND COMBINED 
(January-February 1944) 


Dummy 
Question vartable i} 4 6 & 
Sub- 
cate- 
gory 3 1 2 I 2 I 2 I 2 1! a ! 2 1 2 f 2 1 
Dummy ° 
rariable 1.000 073 800 .332  .559 .374 379 386.196 273 «649 408° 422 366.440 450.417 -294 .220 
I 3 073 073 038 .026 039 O15 031.016 .039 .029 040 017 024 .038 -038  .026 022 O21 
1 .800 400 2244 495 301 .342 822 170 187.578 319° 376 .294 379 -353 365 .233 188 
2 2 B32 -03% .244 1832 -149 083 134.047 134.166 5B WS -143  .106 W750 095 098 O49 
1 2009 026 A95 559 .203  .288 227.136 099 .452 2211 287 AST 315 .225 307 173.159 
3 2 B74 -039 301 A49 .203 B74 166.074 136.228 167.159 -160 .166 L185 1156 116.082 
1 379 O15 342 O83 288 379 149.099 019.358 142.207 AIO 1213 148.204 125 116 
4 2 386 O31 322 134.227 1660 149 2386 009.275 WR 156 W& .179 192 177 123.004 
1 196 O16 .170 047 136 O74 099 -196 34.157 059.128 O60 .122 047.143 069 .OAD 
5 2 273 039 187 -134  .099 136.019 099.034 273 136.066 133.078 ASL 071 O81 .034 
1 649 029.578 166 452 228.358 275 157 649 251.346 L217) 349 273° 334 -204 183 
6 2 409 010.319 1560 211 167.142 178 059 136.251 409 184.138 -233 126 LLL v2 
1 422 O17 376 ALS 287 159.207 156 123 068 346 422 126.262 132.259 135.110 
7 2 366 024.204 143.187 160.119 -148  .060 133.217 kd 126 366 210) 118 110.076 
1 440 0388  .379 106.315 AWB 213 179.122 O78 .349 B38 .262 440 154.275 14300117 
8 2 450 038.353 ATS .225 “185.148 192 O47 -151) .273 -233 132 -210) 154 450 129° 007 
l AIT 026.365 095 .307 156.204 ATT 143 O71 .334 1260.25 ALB 275 17 129.112 
9 2 204 .022  .233 098 173 WWG 125 123.069 O81 .204 VW 135 110.143 129.129 294 
1 220 02 -188 -049 159 O52 116 .094 .059 034.183 092.110 076 1117 097 112 220 
10 2 400 O31 329 144.228 A775 152 157 .066 119.260 199 145 163.175 -202 LAL -129° 078 
1 37h -022 322 108 .247 227 186 .164 .104 067.208 A226 210 117.207 A138 = 213 ANG 12 
1 2 2336 4025 263 127) 161 US4 O85 125 .035 126.180 173.105 148.117 16300 .117 -104  .053 
1 LABS O82 470 150.362 ASS .27¢ 225 .120 090 432 AQ] 6289 -IS1) .290 .226 270 -164 150 
12 2 2435 O38 342 164.234 -1840 157 174 072 143.270 XG 1179 .242 119 .240 .147 ASL .O83 
1 BRT 026 344 090.281 146 .1S9 170.110 057.322 149.208 070 302 135.242 A230 114 
13 3 -392 032 285 A776 .160 -143 £108 -144 .060 -148 197 164.137 -138  .158 105 124 -118 .068 
21 2938, 038 470 133.376 -210 257 227.128 101.426 218 TE -209  .269 :223° .282 165.140 
14 2 ATO -031 398 184.260 18 175 -209° O75 1320 324 6216 185 -180 .206 236.194 -138 005 
1 -281 022 240 050.224 100.187 115 .099 O81 247 092 174 098 .166 094 177 103.094 
15 2 L519 OS A422 LTRS 205 225 187 -210  .008 154.337 2230 198 -216  .224 -258 .200 -165  .104 
1 244 O08 224 050.187 O72 147 W130 O77 022.217 O64 154 O76 
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TABLE 10 (Continued) 


Dummy 
Question variable 10 uw 12 18 14 16 Cow, Lw,2 k0ow,2 o—kiwyz 
Sub- ~ ae gn 
cate- 
gory 2 1 2 1 2 1 3 21 2 1 2 1 
Dummy 
rartable 1.000 400 .371 336.5385 0.4350 387 -392° 538) 4760 1281 5190 244.200 .900 3.862 892 .008 
1 3 073° .O3L 022. 025 0382. 088) 026) 032) 036.0381) 022.045) 009.087) 358 0.275 064 003 
1 SOO | 329.3220 2638 A700 B42 R44 285 470 398 .240 9.422 224 763) A401 3.246 750 -O13 
2 2 3320 144.108) 1270 150) 164 090) 178.1830 184.050 183) 050.293) 329 1.210 .280 -013 
1 O09 228-247 161) 380200 2340 281 ASO 3760 200) 2240 2950 S87 4G 421 2.330 .550 — 004 
3 2 BTA TS 127 A540 188 IS4 .14600 1430 2100 19% = .100) 6225) 072349209 1.499 346 -003 
1 B79 152 E86 08S 62738 157 ISX 108.257 A75 1167 187) 1470 372.258 1.643 378 - 007 
4 2 386.157) 164 01250 2250 174 T7044 227 209.115 .210 113) 367) £139 1.585 .366 -001 
1 -196 066 104.055.120.072) 110 080128) .075) 099.098) 077 193.166 0.852 197 — 004 
5 2 273 119 067) 126 «000.143 057) 148 101182) 031) 154.022.226.282 0.934 216 .010 
1 649.260 298) 180) 4820 270) B22 197) 4260 B24 247) 837) 217) 632-430 2.760 648 —~ .006 
6 2 409.190) 61260 173) 191186 149-164 21S) 216) 092.230) 072.388.1608 1.567 -362 006 
1 A220 145 219 105 280179 OL ATT EBS) 174198) 1630 409.237 1,802 416 — .007 
7 2 366.1630 117) 148) 181.242 070) «13% =—.209 180) .099) 216.064.320.159 1.409 +325 -004 
1 A400 175 207) 117) 290) 119.302) «158.269.206.166 6224 154 427 )=«—.229 1.888 436 — 009 
8 2 450.202.138.163) 2260240 135195) £223) 2386 «0946258 076 = .405 226 1.703 -393 -012 
1 AIT ASE 21300 117) 270 N47) 24200 124 2820 194 177) 209.157) 405.298 1,805 417 —.012 
9 2 294 1200-116 104 1640 13E 1230188 1656188) 6103) 6165) 078 272.064 1.184 273 — 001 
1 2200 078 112) 053.150 6083) 114.068 «=.140) 095 «004.104 =—.083) 210.092 0.941 217 — .007 
10 2 400  .400 57 205.201) 148 170) 200.225 «092.246 073) 370) 174 1.583 366 004 
1 371 871 005) 6.2520 152 185 120.248 61 1620174 1440 358.217 1.587 367 — .009 
ll 2 336.156.095.336 +1550 .213) 158.150.185.060) 2140 035.203) 177 1.238 286 .007 
1 535  .205  .252 535 9.224 2570167) 356.246 £210 266.198) 517 268 2.269 524 -- 007 
12 2 435.201) 6152 0155 224.436 .190 .223) .228 .106) = .255 086.308) .205 1,690 .390 -008 
1 «387.148 185 113.257 387 A104 .271 177.160) .201) 136.376.2263 1.680 388 -- .012 
13 3 392.170) 110.158 167) 190.104 392 +210 .062  .221 048 =.332 157 1.390 321 O11 
21 538.209.248.150) 356.223) 271 538.247) 216.273) 191.519.2274 2.285 -528 — .009 
14 2 A760 2250-161 185 2460 228) 177-210 247 AT6 -2838 .076 .445 .196 1.886 436 -009 
1 -231 « «.092  .162 060 «1210 S106 60) 062.216 281.125) 1387) .276) 1245 1.250 250 —.013 
15 2 S19 2460-174 214) 266.255 201221) 273) 288) 125i 19 A482 .173 2.060 476 006 
1 244 073 144 .0385 .198 086 .136 .048 .191 -076 .137 .244 230.226 1.082 250 — Ob 
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exactly zero, then the w: would be precisely proportional to the 
multiple regression weights. As it is, they are so close to being 
proportional that no further iterations were tried. Little gain in 
discrimination could have resulted from continuing the iterative 


TABLE 11 


A Tsst oF THE PROPORTIONALITY OF THE SIMPLE TRICHOTOMOUS 
WEIGHTING oF PsycHosomMatic ComPLaINts IreEMs To THEIR 
Moutiets REGRESSION WEIGHTS 


Ques- Subcate- 

tion gory Cc Twat kZw,r* c—k iwe 

Dummy .900 19.389 873 027 

‘ 3 067 1.293 058 009 

1 763 16.619 748 015 

; 2 293 5.641 254 039 

1 546 12,592 567 —.021 

- 2 349 7.366 331 018 

1 1372 8.884 .400 —.028 

; 2 .367 8.047 362 005 

1 193 4.720 212 —.019 

‘ 2 226 4.194 .189 037 

1 632 14.554 655 —.023 

‘ 2 368 7.871 341 027 

1 .409 9.697 436 —.027 

. 2 329 6.803 306 023 

1 427 10.153 ABT —.030 

, 2 405 8.149 .367 038 

1 405 9.769 440 —.035 

3 2 272 6.074 273 —.001 

1 210 5.166 232 —.022 

- 2 .370 7.743 348 022 

1 358 8.586 386 —.028 

re 2 293 5.861 264 029 

1 517 12.022 541 —.024 

a 2 398 8.191 369 029 

1 376 9.100 410 —.034 

2 3 332 6.564 295 037 

21 519 12.042 542 —.023 

i 2 445 9.255 416 029 

1 276 6.936 312 —.036 

2 482 10.181 458 024 

15 1 239 6.036 272 —.033 
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process, while sampling error might have been increased. But the 
only conclusion possible from the fact that the iterative process 
could not be carried beyond the first iteration was that differential 
weightings of the items would not yield any substantial improve- 
ment in discrimination over that obtained by simple weightings. 
As a further test of this conclusion, the simple trichotomous weights 
were used for the first iteration of the data in Table 10, and they 
also proved to be almost proportional to the multiple regression 
weights. The data basic to this conclusion are shown in Table 11. 
(The weights here are in reverse order to those used elsewhere in 
Chapters 13 and 14, with a weight of 2 given the subcategory which 
occurred most disproportionately among the psychoneurotics.) 

In summary, then, almost identical results were obtained when 
three different methods of weighting the items in the psychosomatic 
complaints score were tried: 


PERCENTAGES RECEIVING CRITICAL BCORES 


Method of weighting used Cross section Peychoneurotic sample 
Simple dichotomous (0, 1) 28.6% 89.4% 
Simple trichotomous (0, 1, 2) 26.8 89.6 
Differential trichotomous 28.6 90.8 


Considerations deriving from the fact that these items formed a 
quasi scale led us to believe that the simple scorings approximated 
the results to be obtained by employing multiple regression tech- 
niques, and an empirical investigation of the interrelationships 
confirmed this assumption. In view of the almost identical results 
obtained, the simple trichotomous weights were selected for prac- 
tical use, since they were the simplest to score. 


CHAPTER 14 


THE SCREENING OF PSYCHONEUROTICS: 
COMPARISON OF PSYCHIATRIC 
DIAGNOSES AND TEST SCORES 

AT ALL INDUCTION STATIONS’ 


N JULY 1945, all induction stations in the United States were 
instructed to keep a daily record of the NSA scores of all men 
receiving preinduction physical examinations during the month of 
August 1945, the men being classified into one of seven groups 
according to disposition. These categories were a major division 
of men into those found acceptable and those rejected for military 
service, the rejections being divided by cause into administrative, 
medical and physical, and psychiatric rejections. Within the latter 
group, a four-way division into major diagnostic categories also 
was secured, namely, psychoneurotic, psychopathic, psychotic, and 
all others. Thus records on the disposition of over one hundred 
thousand cases were obtained. 

It must be kept in mind that the NSA was not intended to be a 
substitute for psychiatric examination. It was rather to be an 
adjunct or aid to psychiatric examination. Men were to take the 
test and the examining physician would have the test results in 
front of him to aid him in his diagnosis. If aman made a favorable 
score on the test, with no “critical signs,’’ as described in the previ- 
ous chapter, the examination, it was hoped, could be even more 
perfunctory than normally, thus giving the already overburdened 
physician more time for a somewhat more thorough than average 
examination of the doubtful cases screened by the test. In situa- 
tions of unusual time pressure because of the volume of inductees, 
he might, at his discretion, dispense with examination in those 
cases who made favorable numerical scores on the test and showed 

1 By Shirley A. Star. 

? Instructions were given in ASF Circular No. 281 (24 July 1945). 
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no “critical signs.” Actually, during August 1945, the month in 
which the war ended, the rush of inductees was not great and, as 
far as is known, physicians nowhere exercised this option. How 
much they actually paid attention to the NSA scores we do not 
know. In spite of the fact that the test went out under the auspices 
of the Neuropsychiatric Division of the Office of the Surgeon Gen- 
eral and was administered by trained psychologists, it would not be 
surprising if by a good many clinicians it was either treated with 
tolerant amusement or completely ignored, even though their own 
clinical examination hardly could be more than perfunctory in the 
majority of cases. 

The fact, however, that the test scores were available to exami- 
ning physicians, who were free to use them or discard them, means 
that there is a lack of experimental independence in our comparison 
of NSA scores and psychiatrists’ judgments. In developing the 
test, as reported in the preceding chapter, experimental independ- 
ence was obtained. When the test was tried out at induction sta- 
tions, with the quite satisfactory results reported in the previous 
chapter, predictions were made about the psychiatrists’ disposition 
of the cases, in a situation where psychiatrists had no knowledge of 
the test scores. In the nationwide study now being reviewed, we 
are in no position to say how much a knowledge of the test scores 
influenced the correlation between test scores and psychiatrists’ 
ratings. But we do know that at all stations, irrespective of test 
scores, all men were given a personal psychiatric examination in 
addition to taking the test and the relationship between the test 
score and the clinical disposition is a matter which should be of very 
considerable scientific interest, even though the one may have in- 
fluenced the other. 

In order to evaluate the results, we shall look first at the facts as 
to diagnosis and disposition by examining physicians. Next we 
shall relate these diagnoses to the NSA scores made by the men so 
diagnosed. Finally, we shall make some special observations on 
the value of “critical signs’”’ to supplement the NSA score. 


Diagnosis and Disposition of Cases by Examining Physicians, 
August 1945 
Table 1 shows that throughout the United States, in August 1945, 
57.7 per cent of all literate preinduction examinees were found 
acceptable for service on medical grounds. The psychiatric rejects 
constituted 14.0 per cent, of whom the largest proportion were 
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diagnosed as psychoneurotic and the next largest proportion as 
psychopathic.® 

However, as Table 2 makes abundantly clear, there was extremely 
wide variation in the psychiatric rejection rates. The range, among 
the fifty-five induction stations, was from 0.5 per cent at Camp 
Beale, California, to 50.6 per cent at Manchester, New Hampshire. 
Some of the percentages of rejections shown in Table 2 are based on a 
rather small number of cases, but even if we compare pairs of rela- 
tively large stations in the same geographical region we see that 
the variability is great. 


TABLE 1 


CLASSIFICATION OF LITERATE PREINDUCTION EXAMINBES 
in Avuacust 1945 


Classification Per cent in each category 
Acceptable for service 57.7% 


Rejected 
Administrative rejections 1.2% 
Medical and physical rejections (excluding 
psychiatric) 27.1 


Psychiatric rejections (including rejections for 
both psychiatric and other reasons) 


Psychoneurotic 5.6% 

Psychopathic 8.9 

Psychotic 0.3 

Other psychiatric 4.2 14.0 42.3 


100.0% 


Not only was there wide variation in the psychiatric rejection 
rates, but also there was wide variation in the specific diagnoses 
given for these psychiatric rejects. This is shown in Table 3. 
While, in the nation as a whole, 39.9 per cent of all psychiatric re- 
jects were diagnosed as psychoneurotic, the percentages varied, 
among stations with at least 50 rejects, all the way from 2.7 to 90.2. 
There were 29 stations in which the dominant tendency was to 
classify men as psychoneurotic, 16 in which the psychiatric rejec- 
tions were most often psychopathic, and 10 in which they were 
usually called by neither of these labels. 

It might be argued, by way of explaining such enormous varia- 
bility in diagnosis, that the statistics in Tables 2 and 3 represent a 
faithful picture of the actual incidence among the populations 

* Only data on literate inductees are considered in this chapter, since men who failed 


to pass a standard literacy test administered by the psychologists were not given the 
NSA, though they were of course examined by physicians. 
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TABLE 2 


Disrosirion OF PREINDUCTION EXAMINEES, CLASSIFIED BY 
Serviczs Commanp aNp INDucrion StaTion 


(August 1945) 
DISPOBITION 

IN PERCENTAGES 

Accepted Rejected 

3 
SERVICE COMMAND 3 3 

AND 3 Ss ~~ 

INDUCTION STATION & 38 & 

—E 358 

~ Re 
Total 57.7 1.2 27.1 
First Service Command 57.7 1.2 19.8 
Boston (Fort Banks), Mass. 61.0 0.9 19.5 
Portland (Fort Preble), Me. 55.3 3.3 23.6 
Manchester (Grenier Field), N.H. 36.3 2.0 11.1 
New Haven, Conn. 60.2 0.1 22.3 
Rutland, Vt. 63.5 2.4 15.7 
Springfield, Mass. 64.1 0.5 24.0 
Second Service Command 51.3 0.3 36.4 
Albany-Syracuse, N.Y. 61.5 0.1 28.9 
Buffalo-Rochester, N.Y. 60.6 0.3 29.0 
Newark-Camden, N.J. 69.1 — 21.6 
New York, N.Y. 43.2 | 0.3 43.0 
Third Service Command 53.3 0.3 30.1 
Baltimore, Md. 55.6 — 24.0 
Harrisburg, Pa. 50.0 0.6 30.2 
Philadelphia, Pa. 51.5 0.3 40.8 
Pittsburgh, Pa. 47.3 0.2 28.3 
Richmond, Va. 71.3 0.7 23.1 
Roanoke, Va. 54.9 0.1 23.7 
Wilkes-Barre, Pa. 57.1 — 33.7 
Fourth Service Command 60.6 4.5 18.8 
Camp Blanding, Fla. 69.3 12 17.8 
Camp Shelby, Miss. 53.2 6.4 22.1 
Fort Benning, Ga. 70.7 — 17.2 
Fort Bragg, N.C. 64.0 2.5 24.5 
Fort Jackson, S.C. 55.6 12.7 140 
Fort MacPherson, Ga. 64.0 3.2 14.8 
Fort McClellan, Ala. 63.5 19 22.1 
Fort Oglethorpe, Ga. 59.3 1.7 18.3 
Fifth Service Command 62.2 0.5 23.5 
Cleveland, Ohio 63.4 0.1 21.4 
Columbus (Fort Hays), Ohio 66.7 0.1 19.8 
Huntington, W. Va. 66.3 2.3 22.4 
Indianapolis, Ind. 52.6 — 33.5 
Louisville, Ky. 61.7 — 21.0 


561 


TOTAL 

Per 

cent Number 
100.0 107,885 
100.0 6,707 
100.0 2,185 
100.0 617 
100.0 666 
100.0 1,690 
100.0 870 
100.0 429 
100.0 15,262 
100.0 1,411 
100.0 1,668 
100.0 2,690 
100.0 9,488 
100.0 16,828 
100.0 2,987 
100.0 1,789 
100.0 8,483 
100.0 4,687 
100.0 1,878 
100.0 920 
100.0 1,689 
100.0 13,148 
100.0 1,180 
100.0 1,487 
100.0 470 
100.0 1,817 
100.0 2,648 
100.0 1,211 
100.0 1,481 
100.0 8,004 
100.0 13,856 
100.0 2,768 
100.0 2,720 
100.0 2,819 
100.0 2,692 
100.0 2,472 
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TABLE 2 (Continued) 


DISPOSITION TOTAL 
IN PERCENTAGES 


Accepted Rejected 


SERVICE COMMAND 2 3 2 
AND 3 = ae c 
3 
INDUCTION STATION & § 8 2 
& 38 $ Per 
<= SE & cent Number 
Sizth Service Command 63.5 0.9 21.8 13.8 }/ 100.0 10,606 
Chicago, Ill. 71.2 1.8 19.4 7.6 || 100.0 4,628 
Detroit, Mich. 52.6 * 25.8 21.6 || 100.0 4,285 
Milwaukee, Wis. 69.8 0.8 18.3 11.1 || 100.0 1,848 
Seventh Service Command 55.0 * 28.0 17.0 || 100.0 11,780 
Fort Leavenworth, Kans. 53.9 — 30.4 15.7 || 100.0 4,848 
Fort Logan, Colo. 54.9 0.3 28.6 16.2 |} 100.0 1,220 
Fort Snelling, Minn. 66.5 — 19.4 141 | 100.0 $,129 
Jefferson Barracks, Mo. 44.7 * 33.4 21.9 || 100.0 3,083 
Eighth Service Command 65.7 0.4 23.4 10.5 } 100.0 8,381 
Camp Joseph T. Robinson, Ark. 65.2 0.1 26.5 8.2 |] 100.0 1,089 
Dallas, Texas 70.2 0.2 23.5 6.1 || 100.0 1,458 
Fort Bliss, Tex. 67.9 2.7 23.9 5.5 |} 100.0 402 
Fort Sam Houston, Tex. 62.8 0.6 24.0 12.6 || 100.0 948 
Houston, Tex. 69.1 0.2 22.5 8.2 |} 100.0 1,066 
New Orleans, La. 50.0 1.3 27.8 20.9 || 100.0 868 
Oklahoma City, Okla. 70.2 * 19.4 10.4 || 100.0 1,888 
Santa Fe, N.M. 66.4 — 19.5 14.1 }/ 100.0 241 
Shreveport (Fort Hamburg), La. 58.5 — 27.6 13.9 || 100.0 431 
Ninth Service Command 55.1 2.6 34.1 82 1] 1000 12,887 
Butte, Mont.; Boise, Ida.; 

Spokane, Wash. 50.9 0.9 42.8 5.4 |} 100.0 1,268 
Camp Beale, Calif. 95.3 19 2.3 0.5 {| 100.0 214 
Fort Douglas, Utah 55.0 2.4 36.6 6.0 || 100.0 816 
Fort Lewis, Wash. ‘ 73.5 9.5 13.2 3.8 || 100.0 &$ 
Fort MacArthur, Calif. 70.5 2.8 16.2 10.5 |} 100.0 142 
Los Angeles, Calif. 56.8 1.7 34.1 7.4 || 100.0 8,028 
Phoenix, Aris. 47.6 0.2 45.6 6.6 || 100.0 672 
San Francisco, Calif. 55.4 6.5 32.7 5.4 || 100.0 8,226 
Seattle, Wash.; Portland, Oreg. 52.3 0.2 32.0 15.5 || 100.0 2,978 

* Less than 0.05%. 


drawn into these induction stations. This argument would be 
easier to support if the stations within a given region had somewhat 
the same rates and if the variability within regions was much less 
than the variability between regions.‘ But when Pittsburgh had 


‘ Actually, the F-test shows that the variability was somewhat greater between re- 
gions than within regions, with results significant only at the .05 level. 
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TABLE 3 


Peycaiatric DiagNosss oF MEN ReEsEcTED FoR PsycHiaTRIc Reasons, 
CLassiFiIgp BY Service CoMMAND AND INDUCTION STATION 


(August 1945) 


PSYCHIATRIC DIAGNOSIS TOTAL 
IN PERCENTAGES 


SERVICH COMMAND 3 
AND 3 a) 
INDUCTION STATION 33 3 2 — 
er 
& g a & s cent Number 
Total 39.9 27.8 2.1 30.2} 100.0 16,101 
First Service Command 57.2 28.1 5.1 146, 1000 1,218 
Boston (Fort Banks), Mass. 57.9 19.4 2.5 20.2 | 100.0 396 
Portland (Fort Preble), Me. 02 — 3.3 6.5 | 100.0 92 
Manchester (Grenier Field), N.H. 76.8 2.4 13.0 7.8 | 100.0 387 
New Haven, Conn. 25.0 57.9 1.1 16.0 | 100.0 276 
Rutland, Vt. 52.9 23.5 1.5 22.1 | 100.0 68 
Springfield, Mass. 40.8 42.9 2.0 14.3] 100.0 49 
Second Service Command 54.9 31.9 2.8 10.4] 100.0 1,886 
Albany-Syracuse, N.Y. 63.0 22.9 0.7 13.4] 100.0 185 
Buffalo-Rochester, N.Y. 33.0 35.3 0.6 31.1 | 100.0 167 
Newark-Camden, N.J. 49.0 32.2 3.6 15.2 | 100.0 261 
New York, N.Y. 58.1 32.3 3.1 65] 1000 1,282 
Third Service Command 18.4 32.6 1.3 47.7] 1000 2,742 
Baltimore, Md. 8.1 8.4 0.7 82.8 | 100.0 611 
Harrisburg, Pa. 17.2 52.0 1.2 29.6] 100.0 3844 
Philadelphia, Pa. 2.7 17.1 7.8 72.4 | 100.0 268 
Pittsburgh, Pa. 29.8 42.5 0.3 27.4| 100.0 1,118 
Richmond, Va. 21.2 646 4.5 19.7 | 100.0 66 
Roanoke, Va. 143 546 1.0 30.1 | 100.0 196 
Wilkes-Barre, Pa. 9.7 2.0 — 88.3 | 100.0 154 
Fourth Service Command 37.3 28.8 1.2 32.7] 100.0 2,108 
Camp Blanding, Fla. 3.0 17.5 — 79.5 | 100.0 187 
Camp Shelby, Miss. 16.9 81.3 — 1.8] 100.0 872 
Fort Benning, Ga. 28.1 683 — 3.6 | 100.0 &7 
Fort Bragg, N.C. 49.8 32.1 13.9 4.2] 100.0 166 
Fort Jackson, S.C. 14.3 32.3 0.2 53.2] 100.0 456 
Fort MacPherson, Ga. 48.6 15.2 1.0 35.2 | 100.0 216 
Fort McClellan, Ala. 2.8 2.3 — 94.9 | 100.0 178 
Fort Oglethorpe, Ga. 73.9 13.5 — 12.6 | 100.0 628 
Fifth Service Command 35.8 14.0 0.8 49.4] 100.0 1,887 
Cleveland, Ohio 26.0 4.6 0.7 68.7 | 100.0 416 
Columbus (Fort Hays), Ohio 32.8 14.2 0.3 52.7 } 100.0 366 
Huntington, W. Va. 34.7 22.0 1.2 42.1 | 100.0 254 
Indianapolis, Ind. 56.8 7.2 0.3 35.7 | 100.0 $61 


Louisville, Ky. 30.7 24.2 1.9 43.2 | 100.0 480 
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TABLE 3 (Continued) 


BERVICE COMMAND 
AND 
INDUCTION STATION 


ath Service Command 
Chicago, Ill. 
Detroit, Mich. 
Milwaukee, Wis. 


Seventh Service Command 


Fort Leavenworth, Kans. 
Fort Logan, Colo. 

Fort Snelling, Minn. 
Jefferson Barracks, Mo. 


Eighth Service Command 


Camp Joseph T. Robinson, Ark. 
Dallas, Tex. 

Fort Bliss, Tex. 

Fort Sam Houston, Tex. 
Houston, Tex. 

New Orleans, La. 

Oklahoma City, Okla. 

Santa Fe, N.M. 


Shreveport (Fort Hamburg), La. 


Ninth Service Command 


Butte, Mont.; Boise, Ida.; 
Spokane, Wash. 

Camp Beale, Calif. 

Fort Douglas, Utah 

Fort Lewis, Wash. 

Fort MacArthur, Calif. 

Los Angeles, Calif. 

Phoenix, Ariz. 

San Francisco, Calif. 

Seattle, Wash.; Portland, Oreg. 


PSYCHIATRIC DIAGNOSIS 
IN PERCENTAGES 


Psychopathic 


22 
ao 


llipell 


3.4 


TOTAL 


Per 

cent Number 
100.0 1,466 
100.0 844 
100.0 918 
100.0 206 
100.0 2,008 
100.0 684 
100.0 198 
100.0 442 
100.0 679 
100.0 872 
100.0 85 
100.0 89 
100.0 22 
100.0 119 
100.0 8&7 
100.0 181 
100.0 196 
100.0 84 
100.0 60 
100.0 1,036 
100.0 68 
100.0 I 
100.0 49 
100.0 2 
100.0 15 
100.0 222 
100.0 46 
100.0 171 
100.0 463 


3 times the proportion of psychiatric rejects of Philadelphia, when 
Detroit had 3 times the proportion of Chicago, New Orleans 3 times 
the proportion of Dallas, and Seattle-Portland 3 times the propor- 
tion of San Francisco, it is difficult to believe that the standards were 
same in all places. Similarly, it is hard to believe that the same 
standards of diagnosis were being used when, among psychiatric 
rejects, New Haven found 3 times as large a proportion of psycho- 
paths as Boston, Pittsburgh found 5 times as large a proportion as 


the 
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Baltimore, or the Camp Shelby, Mississippi, station found 35 times 
as large a proportion as the Fort McClellan, Alabama, station. 

In the absence of a yardstick providing a common nationwide 
standard, it has not been possible in the past to do more than ex- 
press doubts as to the uniformity of diagnoses. However, for the 
more than one hundred thousand men represented in Tables 2 and 
3 we do have a standard against which this variability can be 
judged. This is the NSA. Let us look first at the variability of 
the proportions screened among the different induction stations 
and see how the NSA findings compare with the actual psychiatric 
diagnoses. 


NSA Scores as Compared with Psychiatric Diagnoses 


Table 4 gives us the basic information. The cutting point on the 
NSA scores, plus the additional information from “critical signs,”’ 
is so adjusted that, the nation over, 31.1 per cent of all men were 
screened. By screened we do not mean, as the previous chapter 
should have made clear, that the test diagnosed a given man as a 
psychiatric case. Rather it simply meant that here was a man to 
whom the busy examining physician should give much more than 
average attention. In the group of men screened by the test should 
be found most of the men eventually diagnosed as psychiatric cases, 
and, because of fallibility both of the test and of the diagnoses, there 
would also be found a good many men who would eventually be 
labeled as fit for military service. If, however, the test screened 
twice as many men at one induction station as at another, this 
should indicate a probability that the psychiatric population was 
higher at the former station than at the latter. 

Actually, as Table 4 shows, there was relatively small variability 
from station to station in the percentages screened by the NSA. 
Whereas the psychiatric rejection rate, as we have seen in Table 2, 
ranged from 0.5 per cent to 50.6 per cent, the data in Table 4 show 
that the percentages screened by the NSA ranged only from 21.2 
per cent to 44.5 per cent, and the standard deviation was only 5.1 
per cent. 

Moreover—and this is a very important point—the variability in 
NSA scores was decidedly due more to the differences between 
regions than to differences between stations within the same region.® 


5 The F-test shows that the tendency for homogeneity within service commands as 
compared with the variability between commands is significant at the .01 level. 
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TABLE 4 


Proportion ScREENED BY NSA AMONG PREINDUCTION EXAMINEES, 
Cuassirigp By Disposition BY SERVIcE CoMMAND AND INDUCTION STATION 


(August 1945) 


PERCENTAGES RECEIVING CRITICAL SIGNS 
AND/OR CRITICAL SCORES AMONG: 


REJECTIONS* 
BERVICE COMMAND 
AND 3 
INDUCTION STATION = & 
5 = 
= § 3 
8 4 S 
als 
<j = 
Total 31.1 21.8 |} 30.3; 69.5 80.8 68.2 70.8 56.4 
First Service Command 28.1 14.3 || 25.0| 87.2 742 57.0 66.0 58.6 


Boston (Fort Banks), Mass. 26.8 14.4 || 28.1| 68.6 77.3 53.3 80.0 548 
Portland (Fort Preble), Me. 22.6 13.2 |} 17.9] 58.9 588 a 66.7 50.0 


Manchester 
(Grenier Field), N.H. 44.5 12.6 || 18.8] 73.6 75.4 100.0 61.5 69.0 
New Haven, Conn. 26.2 14.0 || 27.4| 65.7 78.4 60.7 100.0 61.5 
Rutland, Vt. 24.9 16.5 || 17.2] 58.7 72.4 37.6 0.0 53.3 
Springfield, Mass. 21.2 15.4 || 20.5| 59.0 80.0 38.3 100.0 57.1 
Second Service Command 30.4 22.5 || 28.0] 72.0 78.7 68.5 80.3 448 
Albany-Syracuse, N.Y. 27.2 18.2 || 29.5| 74.0 79.1 81.7 100.0 39.1 
Buffalo-Rochester, N.Y. 30.5 24.0 || 30.4] 69.0 77.6 66.2 0.0 63.5 
Newark-Camden, N.J. 27.1 20.8 || 27.2| 70.1 78.7 71.7 88.9 340 
New York, N.Y. 31.9 23.8 || 26.8] 72.4 78.5 67.3 80.0 38.4 
Third Service Command 32.7 22.6 || 31.0] 69.1 78.1 69.5 889 65.0 
Baltimore, Md. 37.4 27.8 |] 29.7 | 74.4 90.0 74.6 75.0 72.7 
Harrisburg, Pa. 35.0 20.5 || 32.8 | 76.6 89.9 71.1 100.0 77.4 
Philadelphia, Pa. 32.1 23.6 |] 33.3 | 83.4 85.7 84.0 100.0 81.4 
Pittsburgh, Pa. 32.5 19.9 || 28.3] 61.0 73.0 64.2 66.7 43.4 
Richmond, Va. 31.6 25.0 || 41.4] 74.1 78.6 80.7 33.3 61.5 
Roanoke, Va. 35.7 24.0 |] 37.2 | 66.3 78.6 79.4 100.0 37.4 
Wilkes-Barre, Pa. 24.0 16.6 || 23.3] 69.7 93.3 66.7 a 66.8 
Fourth Service Command 36.1 26.1 || 37.3] 76.6 86.0 82.1 100.0 59.5 
Camp Blanding, Fla. 36.2 28.6 || 35.8} 79.5 75.0 75.0 a 80.7 
Camp Shelby, Miss. 42.4 25.1 || 41.9| 84.0 93.4 826 a 60.0 
Fort Benning, Ga. 36.4 28.8 |] 39.3} 77.2 93.7 718 a 650.0 
Fort Bragg, N.C. 39.9 28.9 || 45.7) 95.8 92.8 98.1 100.0 100.0 
Fort Jackson, 8.C. 30.0 25.2 || 29.9] 57.5 95.6 85.1 100.0 30.5 
Fort MacPherson, Ga. 34.6 24.7 || 35.5 | 70.4 82.7 63.4 100.0 55.0 
Fort McClellan, Ala. 37.5 26.9 || 38.4 | 93.2 100.0 100.0 a 92.8 
Fort Oglethorpe, Ga. 36.6 23.1 || 33.4| 78.8 83.3 79.7 a 50.6 


EEE ae 


* Administrative rejections have been omitted because of the small number of men in this category. 
* No men in this category. 
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TABLE 4 (Continued) 


SERVICE COMMAND 
AND 
INDUCTION STATION 


Fifth Service Command 


Cleveland, Ohio 

Columbus (Fort Hays), Ohio 
Huntington, W. Va. 
Indianapolis, Ind. 
Louisville, Ky. 


Sizth Service Command 


Chicago, Ill. 
Detroit, Mich. 
Milwaukee, Wis. 


Seventh Service Command 


Fort Leavenworth, Kans. 
Fort Logan, Colo. 

Fort Snelling, Minn. 
Jefferson Barracks, Mo. 


Eighth Service Command 


Camp Joseph T. Robinson, Ark. 
Dallas, Tex. 
Fort Bliss, Tex. 
Fort Sam Houston, Tex. 
Houston, Tex. 
New Orleans, La. 
Oklahoma City, Okla. 
Santa Fe, N.M. 
Shreveport 
(Fort Hamburg), La. 


Ninth Service Command 


Butte, Mont.; Boise, Ida.; 
Spokane, Wash. 

Camp Beale, Calif. 

Fort Douglas, Utah 

Fort Lewis, Wash. 

Fort MacArthur, Calif. 

Los Angeles, Calif. 

Phoenix, Ariz. 

San Francisco, Calif. 


Seattle, Wash.; Portland, Oreg. 
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PERCENTAGES RECEIVING CRITICAL SIGNS 
AND/OR CRITICAL SCORES AMONG: 


24.1 14.6 
31.5 19.2 


33.9 27.1 
32.3 24.3 
33.8 29.7 
33.8 28.8 
26.9 16.7 
36.3 29.9 
34.3 20.2 
37.3 32.4 
33.6 24.2 


33.5 25.6 
32.9 25.0 


32.8 23.5 
30.2 28.8 
28.9 22.5 
22.8 12.9 
25.9 16.0 
32.9 24.1 
29.3 16.8 
35.3 29.2 
33.2 24.8 


i 
a 
3 
3 
3 
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66.0 748 738 a 


100.0 a 1000 a 

86.6 100.0 84.6 100.0 
80.0 87.4 76.6 100.0 
73.6 81.1 63.4 100.0 
75.8 80.5 66.1 0.0 
66.1 87.9 59.4 53.2 
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The largest proportions screened by the NSA were in the South, 
the smallest proportions in the Middle West and New England. 

Now, these two sets of facts, (1) that psychiatric diagnosis varied 
widely from station to station and (2) that NSA scores had much 
less variation,® still may have little significance unless it can be 
shown that, in spite of these overall differences, the men screened 
by the NSA did tend to include a majority of the psychiatric rejects. 

What are the facts? Let us look again at Table 4. The top line 
of this table summarizes the findings for the country asa whole. It 
shows the following proportions screened by the NSA: 


Among all men examined 31.1% 
Among all men accepted for service 21.8 
Among nonpsychiatric rejects 30.3 
Among all psychiatric rejects 69.5 


If we break down the psychiatric rejects further, we find that the 
test was particularly effective in screening men diagnosed as psy- 
choneurotic, less so in screening other classes of psychiatric rejects. 
Proportions screened : 


Among psychoneurotic rejects 80.8% 
Among psychopathic rejects 68.2 
Among psychotic rejects 70.8 


Among all other psychiatric rejects 56.4 


As we see, the NSA, in spite of the variability in psychiatric 
diagnosis, was successful in picking out four out of five of the men 
subsequently diagnosed as psychoneurotic, at a cost of also screen- 
ing 21.8 per cent of the men subsequently passed as fit for military 
service. This is about the same order of discrimination which we 
saw in the earlier results reported in the previous chapter. But 
the test was, as had been feared in advance, not quite so successful 
in spotting the kind of people diagnosed as psychopathic, screening 
only 68.2 per cent. It was even less successful with the ‘‘all other” 
category, which included a great variety of things and which was 
highly varied from one induction station to another.’ 

We must repeat that the test and the clinical diagnosis were not 
necessarily experimentally independent. But these results do show 


¢ The coefficient of variation of psychiatric rejection rates is five times that of propor- 
tions screened by the NSA. 

7 At one station, a representative from the Research Branch found physicians reject- 
ing as “emotionally immature” almost all boys under twenty who were not yet sprout- 
ing a respectable beard. 
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that, whatever the test was measuring, it was measuring verbal 
behavior which tended to discriminate between men accepted for 
service and those rejected on psychiatric grounds. The significance 
of this finding is strengthened by the further finding, shown in 
Table 4, that at every individual induction station with at least 
fifty psychiatric rejects the NSA successfully screened a majority 
of those rejected. Also, at individual stations throughout the 
country, the tendency was for the test to be most effective in de- 
tecting psychoneurotics, somewhat less so in detecting other cate- 
gories. Among 48 induction stations for which complete diagnostic 
comparisons are possible,* psychoneurotics were detected by the 
NSA more successfully than either psychopaths or “others” in 40 
cases, and psychopaths were more frequently screened by the NSA 
than “others” in 31 cases. The completely consistent pattern of 
most success with psychoneurotics, next with psychopaths, and least 
with others is observable in 27 out of the 48 cases. This suggests, 
at least, that in spite of the wide variations from station to station 
in labels pinned on the psychiatric rejects, the particular diagnoses 
were not wholly semantic accidents, depending merely on the per- 
sonal point of view of the local doctors. If exactly the same kind 
of men had been called psychoneurotics at one station and psycho- 
paths at another, we hardly would have observed the uniformity 
with which the NSA screened psychoneurotics more successfully 
than psychopaths. There must have been at least a central core of 
agreement on diagnosis from station to station, as well as, to be sure, 
great divergency.’ 

The variations between induction stations, in proportions of all 
men accepted after medical examination, in psychiatric rejection 
rates, in the proportions of all men screened by the NSA, and in 
the proportions of psychiatric rejects screened by the NSA, may 
be seen readily in Table 5, which puts into summary form much of 
the data we have been discussing from Tables 2, 3, and 4. 

We now have seen (1) that psychiatric rates varied widely, (2) 


* Omitting psychotics because of the small number of cases. 

* To put it another way, if we assume a fairly fixed proportion of psychopaths in each 
population of psychiatric rejects and ascribe deviations from the proportion to mere 
semantical differences, then the larger the proportion classified psychopathic the larger 
the proportion of actual psychoneurotics that would be included among them. Since 
psychoneurotics were more successfully detected by the NSA, it would have to follow 
that the larger the proportion of psychiatric rejections classified as psychopaths, the 
larger the proportion of psychopaths that would be detected by the NSA. This is not 
the case however. In each instance, the proportion of a diagnostic group detected by 
the NSA was independent of the proportion that group was of the psychiatric rejections. 
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that the proportions screened by NSA varied much less widely, 
(3) that for the country as a whole and for individual stations the 
test did discriminate, especially with respect to psychoneurotics. 
Yet the fact is that the small variation in the proportions screened 
by the NSA in hardly any way accounts for the large variation 


TABLE 5 


Summary oF INTERSTATION VARIATIONS IN REJECTION 
Rates AND Proportions ScREENED By NSA 


FREQUENCY OF GIVEN RATE 
BY SERVICE COMMAND 


1st 2nd Srd 4th Sth 6th 7th 8th Gh Total 
Proportion of all 


men 
Under 50.0 1 1 
50.0-53.9 
54.0-57.9 1 
58.0-61.9 2 2 
62.0-65.9 2 
66.0-69.9 1 
Over 69.9 1 
Psychiatric 
rejection rate 
Under 6.0 
6.0- 8.9 
9.0-11.9 
12.0-14.9 1 
15.0-17.9 
18.0-20.9 
Over 20.9 
Proportion of all 
men screened by NSA 
Under 24.0 
24.0-26.9 
27.0-29.9 
30.0-32.9 3 
33.0-35.9 2 
36.0-38.9 1 
Over 38.9 1 
Proportion of Bey 
chiatric rejects 
screened by NSA 
Under 60.0 3 1 1 1 6 
60.0-64.9 
65.0-69.9 2 
70.0-74.9 1 
75.0-79.9 
80.0-84.9 
Over 84.9 
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in psychiatric rejection rates among individual stations. If we 
exclude the two extreme stations in psychiatric rejection rates 
(Manchester, New Hampshire, and Camp Beale, California), the 
correlation coefficient for fifty-three stations between the two vari- 
ables is only .11 = .14. 

To trace further the reasons for the wide differences from one 
service command to another and from one induction station to an- 
other in psychiatric rejection rates would lead us into realms of 
speculation for which there are little supporting data. It should be 
noted, however, that the fact that the two service commands cov- 
ering most of the South were highest in the proportion screened by 
the NSA is quite consistent with the findings in the Army, reported 
in Volume II, Chapter 8, which showed that men of lowest educa- 
tional level had the highest psychiatric rates—both as measured by 
the Research Branch test scores and as demonstrated in actual 
practice in the Army. Since the South has much the lowest edu- 
cational level in the country, it would be expected that it would 
have a somewhat higher proportion of preinductees screened by 
the NSA than other regions in the country. But among the nine 
service commands, the two covering most of the South were not 
the highest in psychiatric rejection rates. The Eighth Service Com- 
mand in the Southwest had a psychiatric rejection rate of 10.5 per 
cent, one of the lowest in the country, while three other service com- 
mands had higher rejection rates than the Fourth, which is in the 
Southeast. In so far as there were regional differences in rejection 
rates, these differences may be more closely related to some prevail- 
ing administrative policy than to any other factor. About all 
that can be said for the variations in diagnosis and disposition 
within Service Commands is that there is a strong likelihood that 
they reflect different standards used by the doctors at different 
stations. 


Further Observations on the Value of Critical Signs to 
Supplement NSA Scores 


Chart I shows, for each numerical score on the NSA, the disposi- 
tion of examinees in August 1945. It will be seen that the percent- 
age rejected as psychiatric dropped fairly regularly, tending to 
flatten out after a score of 22 was reached. 

In Chart I, the data are shown for all men, whether or not they 
showed up as possessing “‘critical signs’ —that is, gave an unfavor- 
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able response to any of the supplementary questions described in 
the previous chapter. 

The efficacy of considering as screened all men with one or more 
critical signs, irrespective of their NSA score, is shown in Chart II. 
On the right is shown the disposition of cases, by NSA score, among 
those who did not receive critical signs. There the proportion of 
psychiatric rejects drops off very sharply among men making high 
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NSA scores. By contrast, look at the left-hand diagram in Chart 
II. This shows the disposition by NSA scores among those with 
one or more critical signs. There is still a relationship between the 
proportion of psychiatric rejects and test scores, but even among 
those with high NSA scores about 30 per cent, on the average, were 
psychiatric rejects. 

Who were the men who had one or. more critical signs, but made 
high NSA scores? We can see in Chart III. Here the psychiatric 
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rejects with one or more critical signs are represented as 100 per 
cent for each NSA score interval. It becomes evident at once that 
as the NSA score increased the proportion of psychiatric rejections 
screened as psychoneurotic went down, while the proportion screened 
as psychopathic went up. The proportion of psychotics and “other 
psychiatric cases” to all psychiatric cases was relatively constant. 
The addition of critical signs to the original test did serve its in- 
tended purpose to some extent, in that they tended to screen a sub- 
stantial number of psychopaths, as well as others not psychoneu- 
rotic, who otherwise would have made acceptable scores. 

Nevertheless, as we have seen, so varying was the psychiatric 
diagnosis from station to station, even the use of signs did not enable 
the test to discriminate what the psychiatrists would do about 
psychopaths and others as well as it discriminated what they would 
do about psychoneurotics. 

For the benefit of the student who wishes to study more in detail 
the effectiveness of the test with or without signs, for various diag- 
noses, Table 6 is presented, summarizing the NSA scores, for all 
types of men found acceptable or rejected. 

It can be determined from the data in Table 6 that the most effi- 
cient cutting point for discriminating psychiatric rejections from 
acceptable men, on the basis of their NSA scores alone, would appear 
to be about 21, which is a little higher than the cutting point of 19 
when the same test was given to Army recruits after three months 
of service. At the cutting point of 21, the percentages screened by 
either sign or score’ were 21.8 per cent among men found acceptable 
for service and 69.5 per cent among men rejected for psychiatric 
reasons. While this represents a rather high level of differentia- 
tion, it does not reach the level of detection to be desired if the test 
were to be used for eliminating individual psychiatric examinations 
in the majority of cases. At this level of performance, alinost a 
third of the men psychiatrists rejected would not even have come 
to their attention. The cutting point of the test could be shifted 
to a point at which the detection of psychiatric rejections would 
reach an acceptable level, say a score of 26 used in conjunction with 
critical signs, at which point 87.2 per cent of the psychiatric rejec- 
tions were screened. But such a shift would make the test not too 
efficient at the other end, requiring the examination of 57.7 per cent 
of the men found to be acceptable. This would have saved the 


Here, as before, the actual cutting point is itself a critical score, so that we screen 
all men possessing a critical sign or having a score of 21 or less. 
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individual examination of only about one third of the men passing 
through the induction stations at this time. While even this result 
would represent some saving of psychiatrists’ time, it fell short of 
the results originally expected. 

Nevertheless, in view of the very brief time available to physi- 
cians for the psychiatric examination, there can be no doubt that 
the test score and critical signs should have served as a useful signal 
that here was a man who needed a thorough going over, while here 
was a man who could be given a cursory inspection. If this prac- 
tice had been carried out in the earlier periods of the war before 
the NSA, when the induction stations were clogged and hundreds 
of thousands of men slipped by only to be caught later in the Army 
or Navy and discharged for psychiatric reasons, the saving in effi- 
ciency to the Army and in money to the American taxpayer might 
have been enormous. Until, however, psychiatric diagnosis is 
better standardized than it is today, it is likely that the predicta- 
bility of psychiatric screening tests will fall far short of ideal. 


CHAPTER 15 


STUDIES OF THE POSTWAR PLANS OF 
SOLDIERS: A PROBLEM IN PREDICTION" 


Introduction 


8 HAS been observed in Chapter 12, the present chapter and 
that following may be regarded as presenting the case history 
of a research project. Data on soldiers’ plans were utilized to make 
predictions of the number of Army veterans who might be expected 
to enter certain fields of endeavor after the war. The predictions 
were not the result of refined statistical techniques. They had no 
basis in previously observed relationships between prediction items 
and a criterion. They involved a number of assumptions which 
could not be checked and a number of contingencies which could 
not be evaluated until after soldiers were discharged in large num- 
bers. Nevertheless, they were widely used and proved on the whole 
to be reasonably accurate forecasts. Two follow-up studies to 
evaluate the extent to which soldiers carried out or attempted to 
carry out the plans they expressed prior to discharge permit an 
evaluation of the procedures used and give certain clues as to the 
conditions under which expressed intentions and plans are predic- 
tive of behavior. 

Relatively early in the war, various groups expressed interest in 
finding out what effect Army service was likely to have on the occu- 
pational plans and intentions of the selectee. Employers were in- 
terested in estimating the number of their former employees likely 
to return to their old jobs; educators were interested in learning 
how many of the men might want to return to school. Many gov- 
ernment departments whose programs included aid to returning 
veterans were seeking accurate information on the demands which 

1 By John A. Clausen. The initial study of soldiers’ postwar plans was developed by 
Lyonel C. Florant and Louis Guttman. The main surveys in this field were directed 
by Clausen. Substantial contributions to the development of the final question- 


naire and its analysis were made by Leta Adler, William McPeak, Shirley Star, and 
Edward Suchman. 
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would materialize with heavy demobilization. This was particu- 
larly true of agencies whose programs tied in with legislation like 
the Servicemen’s Readjustment Act of 1944. The Department of 
Agriculture, for example, was concerned with providing information 
on farming to military personnel interested in buying a farm, to 
insure that such plans might have their basis in a realistic appraisal 
of opportunities. The Department of Commerce and the Smaller 
War Plant Corporation were particularly interested in the plans of 
the men and women in service who might try to start business enter- 
prises. The United States Office of Education sought information 
on the plans of military personnel for school or college attendance 
after their discharge. Selective Service had a need for information 
on the extent to which selectees and volunteers alike intended to 
return to their previous employers. The Veterans Administration, 
charged with administering most legislation affecting veterans, in- 
cluding the educational and loan provisions of the GI Bill of Rights, 
would require estimates of soldiers’ intentions with respect to these 
provisions. The War Department itself was concerned with getting 
some indication of the intentions of its personnel to remain on active 
duty after the war. 

Here, then, was a problem in prediction or, more accurately, here 
was a series of problems in prediction. Primarily, interest was 
focused on answering the question, ‘‘How many?” Alternative 
approaches were possible. One might attempt to estimate the 
proportion of all men in the Army who would follow this or that 
possible course of action, basing such an estimate on a survey of the 
plans and characteristics of a cross section of the Army population. 
Or, one might approach the problem from the other side and esti- 
mate the relative opportunities which might be available to the 
veterans after the war and the probable pressures upon those oppor- 
tunities, attempting to make some quantitative judgments. 

Aside from the practical problem of arriving at an estimate of 
“how many,” there were a number of rather intriguing theoretical 
problems which emerged as the studies progressed. There was 
the problem of assessing statements of intention applied to a vague 
future. Could a soldier’s statement that he expected to go to school 
after being discharged from the Army be taken as evidence that he 
was really likely to go? More generally, under what conditions 
are statements of intention predictive of future behavior? 

Most predictive studies in the social sciences have been based 
largely on the deterministic effects of ‘background characteristics.” 
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How important would such characteristics prove for estimating 
who, as well as how many, would go to school? Where background 
and intentions clashed as predictors, which would be the most 
potent? 

What kinds of questions and of interview or questionnaire design 
would give the most trustworthy results? Could any reliable esti- 
mates be made prior to demobilization? How much effect would 
a change in situation—from “soldier without immediate prospect 
of discharge” to “separatee 24 hours away from veteran status’— 
have on postwar plans? How would a change in the situation 
encountered after discharge affect the actual carrying out of plans? 

As is almost always the case in social science research where a 
large number of uncontrolled variables are involved, the studies 
reported here did not yield unequivocal answers to the questions 
posed above, but they did yield tentative answers to some of them 
and suggested hypotheses for future testing in the case of others. 
The predictions based upon these studies were not uniformly accu- 
rate. Soldiers’ plans, as analyzed and evaluated, served as a basis 
for reasonably good predictions of the number of men who attended 
full-time school, returned to farming after the war or returned to 
their previous employer; but plans for business ownership and 
government jobs proved to be less closely related to actual per- 
formance. 

The following sections of the present chapter describe the general 
problems faced in carrying through the research program centered 
on soldiers’ postwar plans and take up in detail the procedures used 
and the results achieved in attempting to estimate how many vet- 
erans would seek to attend school or college after their military 
service. In Chapter 16 we shall first examine two other instances 
of relatively successful prediction—extent of return to previous 
employer and extent of return to farming—and then consider two 
less successful attempts—predictions of business ownership and of 
governmental employment—seeking to analyze the reasons for the 
relative accuracy or lack of accuracy in these predictions. 


The Problem of Ascertaining Soldiers’ Plans 


Let us first consider the problem of trying to ascertain the plans 
soldiers had for their civilian careers at the time of the pretests for 
the first major survey on this subject in the spring of 1944. Four 
fifths of the men in the Army had been in service for more than a 
year; roughly a third had been in service for more than two years. 
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But the cross-channel invasion was yet to be launched, and the 
prospects for an early return to civilian life by any substantial num- 
ber of soldiers were very dim considering the additional fact that 
Japan still controlled the Philippines and most of the other terri- 
tory she had wrested from the Allies in the first year after Pearl 
Harbor. Given the effect of two years of Army life—two years 
out of contact with civilian jobs or school—and the uncertainty of 
the future, how many men would have plans clearly enough in mind 
to express them in response to a survey? Would not many soldiers 
simply put down the first thing that came into their heads, or 
equally bad, would they not rise to the bait of any suggestions con- 
tained in questions as to their intentions? There was still another 
possibility—that in the time they had for dreaming, during the dull 
routines of Army life, they would conjure up wonderful images of a 
postwar career which they could not in reality hope to achieve. 
These dreams, then, and not reasoned plans, would be caught by 
the survey. 

But even if accurate indications of their plans could be drawn 
from the soldiers, of what value were the intentions of men in such 
a state of uncertainty, so subject to new and changing conditions 
before they could hope to carry out their plans? The Servicemen’s 
Readjustment Act of 1944, popularly called the GI Bill of Rights, 
would be passed in June, but not more than a tiny fraction of the 
men knew the provisions of this bill at the time of the first survey. 
Reemployment, for those who had prior to induction held nontem- 
porary jobs, was guaranteed by the Selective Service Act, provided 
that the job still existed. But no one in Washington could say how 
many servicemen would qualify for reemployment under the provi- 
sions of that act. Some economists were predicting eight million 
unemployed during the demobilization period. Few men in service 
knew what they could expect to find after discharge. 


The Background of Soldiers’ Plans 


At least four fifths of the men who entered the Army after Sep- 
tember 1940 came more or less directly from civilian jobs which 
they had occupied as employees. Roughly a tenth entered the 
Army within a few months after completing or leaving school. A 
smaller proportion had been self-employed in small businesses or in 
agriculture, leaving perhaps 2 or 3 per cent who had experienced a 
considerable period of unemployment prior to their enlistment or 
induction. Among the men who had been employed, many had 
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held jobs for only a year or two, or even less. A considerable pro- 
portion of the men inducted during the latter stages of the war had 
been employed in war industry. 

For some men, Army service represented a disruption of a planned 
career—a disruption of schooling, of apprenticeship, of growth in a 
chosen field of endeavor. For other men, service meant a break in 
a boring routine, a chance to get out of a job into which they had 
perhaps drifted by taking the course of least resistance. The return 
to civilian life might mean either taking up where one left off or 
making a fresh start. For many soldiers, perhaps for a majority, 
the break caused by Army service meant a chance to evaluate 
where they had gotten and to reconsider where they were going. 
Most men had worked prior to entering the service and might be 
expected to work after their discharge, but beyond that, the picture 
was not clear. Some men felt sure they would go back to their old 
jobs; some felt sure they would not; many were between these 
extremes. 

One of the first facts that forced itself into view in the early pre- 
testing of questions relating to soldiers’ postwar plans was that any 
man could be classified as to his interests and plans along a number 
of dimensions. Picture a series of scales each representing interest 
in a specific type of postdemobilization activity under consideration 
—for example, interest in going to school, in going back to work for 
one’s preservice employer, in starting a business, etc. A soldier 
planning to return to work for his preservice employer might rank 
high on a scale of interest in this dimension yet still be considering 
going to school if government aid were available to him, and hence 
rank far from the bottom with respect to interest in full-time school. 
Most men, in fact, even though fairly sure of their plans, were at 
least considermg various alternatives depending on the materializa- 
tion of a number of contingencies. 

This was not the only problem of differing dimensions, however. 
Most men planning to work as employees after discharge were ori- 
ented toward a particular employer or industry, toward a particular 
occupation, or even toward a particular occupation in a particular 
industry. But some men were less concerned with occupation and 
industry than with the location of their employment, the salary at 
which they would work, the independence they would have—or 
security, or the compatibility of the work, or the opportunity for 
advancement. While one might safely categorize a large group of 
soldiers as planning to work for an employer, it was not possible to 
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make any adequate classification of the details of their plans be- 
cause they could not be evaluated in terms of any single dimension. 
Adequate classification could be made only when a dimension was 
clearly specified and the interests of all soldiers in the survey sample 
were evaluated with respect to that dimension. Further, if the 
dimension was one arbitrarily imposed without regard to the sol- 
dier’s frame of reference, there would be less likelihood that the 
classification would be meaningful or predictive of subsequent 
behavior. 

There were, then, a number of serious problems to be overcome 
merely in order to arrive at an adequate classification of the plans 
which soldiers had. Beyond this, there was the crucial question of 
whether or not useful predictions could be based upon a knowledge 
of soldiers’ plans. 


SECTION I 


THE PREDICTION PROBLEM 


The attempt to predict what veterans would do after discharge, 
from a knowledge of their plans expressed prior to discharge, posed 
a type of prediction problem which has not received much attention 
in the literature of social science methodology. 

To the writer’s knowledge, there exists no adequate classification 
of the types of prediction which have been or might be undertaken 
in the social sciences. It is possible to classify prediction studies 
along a number of dimensions relating to methodology, degree of 
specificity, type of activity predicted, etc. One may, for example, 
distinguish between ‘‘absolute”’ predictions (i.e., involving unequiv- 
ocal specification of performance for the individual case or the 
course of group action) and actuarial prediction (involving specifi- 
cation of the probability that individuals of stated characteristics 
will behave in a given way). Again, one may distinguish between 
prediction based at least in part upon individual intentions and the 
individual’s conception of his role—self-prediction—and prediction 
based upon the judgments of others or upon factors in the back- 
ground of the individual ; between predictions based upon statistical 
analysis of the past relationship of various factors to the criterion 
which is to be predicted, and predictions based upon the construc- 
tion of ideal types or coherent behavior systems not involving pro- 
jection from past experiences under similar circumstances. 

For purposes of fitting the present study into a larger frame of 
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reference, it may be useful to classify prediction studies according 
to the specificity and type of activity predicted, as follows: 


A. Predictions for the individual case 


1. Performance—level of attainment, adequacy of adjustment, etc.— 
in a specified role or task (here would be included the bulk of studies 
on prediction of marital adjustment, academic or occupational suc- 
cess, parole behavior, etc.) 

2. Prediction of decisions, choices, preferences in selection, etc., usually 
involving a knowledge of the individual and of the situation in which 
the decision or choice is to be made. Such studies might be sub- 
classified according to whether they relate to: 

a. Unique situations and one-time decisions (e.g., occupational 
choice at time of leaving school or at time of demobilization from 
the Armed Forces) 

b. Extended courses of action or developmental sequences (e.g., 
changing interests and activities with aging) 


B. Predictions for groups of individuals 


. Frequency of occurrence of specified behavioral characteristics in 
given groups (e.g., prediction of the number of juvenile delinquents 
likely to come from a given population segment) 

2. Characteristics associated with choice of or success in a particular 
activity (e.g., prediction of the characteristics of personnel who 
might be secured for the Armed Forces by voluntary enlistment) 

3. Reactions of members of a group to a particular situation or program 
a. Where focus is upon proportions reacting in a given way (e.g., 

prediction of attitude changes resulting from exposure to a pro- 
gram of propaganda) 

b. Where focus is upon the course of mass behavior (e.g., probability 
of race riots in connection with a proposed change in the pattern 
of relations between races) 

4, Patterns of cultural change or end products of social processes (e.g., 
prediction of fashions, of the course of inventions, of the impact of 
technology on other parts of culture, etc.) 

5. Trends in vital indexes, demographic characteristics, etc. 


— 


In so far as individual prediction is based upon an actuarial ap- 
proach, the predictive process would involve type B-1 before pre- 
dictions under A-1 or A-2 could be made. The methods used in 
“screening” personnel, accepting or rejecting individuals above or 
below a certain critical score, based upon a knowledge that groups 
of individuals so categorized have a high expectancy of success or 
failure, may be considered one instance of this type. Where non- 
actuarial approaches are used, the predictive process for the indi- 
vidual case will usually differ significantly from that used in pre- 
dicting for groups. 

The great bulk of methodological studies in prediction for the 
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individual have dealt with problems where the objective was pre- 
diction of performance level or adjustment of the individual (A-1) 
through analysis of the past or prevailing frequency of high or low 
adjustment in various subclasses of the population studied (B-1).? 
The possibility of predicting directly for the individual, as a unique 
personality in a unique socio-psychological field, using intensive case 
history materials, has been stressed by Allport’ and others, but 
relatively few examples of this type of prediction—as prediction— 
are to be found in the literature. 

In the studies under consideration, several types of prediction 
were attempted. As was noted in the introductory paragraphs, the 
initial objective was to estimate how many veterans might be ex- 
pected to make certain choices or follow certain courses at the time 
of leaving the Armed Forces. Thus, the primary classification 
would be type B-1. A second objective was to estimate what kind 
of veterans would enter specified activities. What proportion of 
veterans returning to school would be married? What proportion 
of veterans who wished to farm would have experience in farming? 
What kinds of veterans would try to start businesses of their own, 
and what sort of financial resources would they bring to the enter- 
prises? These were important questions to answer in order to pro- 
vide realistic information and other assistance to veterans. These 
would be predictions of type B-2. 

Most predictive studies in the social sciences have involved gen- 
eralizing from past observations of the relationship between a set of 
predictive items and the criterion to be predicted. But in the pres- 
ent instance no previous experience existed to indicate which items 
were truly predictive. Therefore, the approach had to be in terms 
of an analysis of the dispositions and intentions of individuals as 
affected by military service, or based upon the projection of trends 
in activities pursued before the war by the wartime population of 
the Armed Forces, or upon judgments as to the opportunities which 
might be available to veterans after the war. 

In so far as interest was confined to forecasting how many soldiers 
would go to school, farm, or start businesses, the projection of past 
trends was feasible but not very promising. One might estimate 
school attendance, for example, by applying rates of school attend- 

2The monograph by Horst et al., The Prediction of Personal Adjustment (Social 


Science Research Council, New York, 1941), dealt almost exclusively with this type of 


prediction. 
3 Allport, Gordon, The Use of Personal Documents in Psychological Science (Social 


Science Research Council, New York, 1942), pp. 154-161. 
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ance by age and level of previous education for the civilian popula- 
tion to an assumed age by education distribution of veterans at 
time of discharge. Or one might utilize census data on self-employ- 
ment by age groups to estimate the probable increase of self-em- 
ployment among the Army population had there been no war. But 
such methods must of necessity ignore the fact that the war had in 
all probability modified previous trends and relationships. With 
government aid promised to the veteran who wished to return to 
school, it seemed idle to assume that the number who would return 
to school would merely reflect past trends and would therefore be 
markedly less than the number who left school to enter the service. 

For this reason, the approach to prediction for the whole veteran 
population was made through an attempt to predict for the indi- 
vidual. If one could make a sufficiently intensive analysis of the 
plans and characteristics of individual soldiers, so as to have confi- 
dence that one could make certain qualified predictions for the 
individual, it might be possible to build up estimates for the service 
as a whole. The problem was, therefore, viewed originally as one 
of making the best possible assessment of soldiers’ plans, evaluating 
those plans in the light of their definiteness and realism and in 
terms of the characteristics of the planner, and then of presenting 
these data in sufficient detail and with necessary qualifications so 
that the consumer could consider them along with other data avail- 
able in his specific field of interest. 

In presenting the data on soldiers’ plans, the policy was followed 
of projecting the sample estimates to the total population of male 
personnel in the Army as of the date of the survey. No assump- 
tions were made about men already discharged or men yet to be 
inducted into the service. It was recognized that most people 
interested in such data would like to have estimates for the total 
veteran population, but such a population could not be clearly 
defined until the war was over. 

At the time the first Army-wide estimates of soldiers’ plans were 
prepared in the summer and fall of 1944, over a million men had 
been returned from the Army to civilian life, predominantly by 
reason of physical and mental disabilities. Many more men would 
be discharged before general demobilization would begin. In large 
proportion they would be eligible for the same benefits as veterans 
later to be discharged. But one could not safely assume that men 
previously discharged had the same orientation toward postwar 
jobs and school as did the group surveyed, and one could not assume 
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that men to be discharged by reason of disability would sustain the 
plans they had earlier expressed. Therefore, one could generalize 
survey findings on soldiers’ plans only to the population actually 
represented by the sample surveyed. 

Further, it was stressed that data on plans were only one element 
to be considered in any prediction of what veterans would do after 
the war. If plans were not modified by changing circumstances 
prior to discharge they should at least indicate what veterans would 
try to do, and not necessarily what they would actually do. 

Nevertheless, upon their release, the data on soldiers’ plans were 
frequently quoted and requoted as predictions of the number of vet- 
erans who would attend school, start small businesses, and return to 
farming, without reference to the fact that the survey findings rep- 
resented only a portion of the total population of veterans (i.e., 
only men in Army service at the time of the survey) and without 
reference to the effect of the economic conditions to be encountered 
either on the sustaining of soldiers’ plans or on the success soldiers 
would have in carrying out those plans after discharge. 

If some consumers used the data indiscriminately, others who 
had a need for estimates of what veterans would do dismissed data 
on soldiers’ plans as irrelevant. ‘Soldiers don’t know what they 
want to do,” argued one official, “I’ve talked with dozens of them 
and most of them haven’t the foggiest notion of what they will do 
after they’re discharged.” 

But despite indiscriminate use of the data in some quarters and 
skepticism as to their value in other quarters, the release of a series 
of reports based on the first Army-wide survey of soldiers’ plans was 
followed by many requests for further details and for continuing 
studies to indicate possible shifts in soldiers’ intentions. The plan- 
ning of subsequent studies in this field recognized much more fully 
than had the initial planning that the prediction problem would 
have to be faced squarely in future work. 

As a coordinate part of a world-wide survey of soldiers’ plans 
scheduled for the summer of 1945, it was decided to survey the plans 
of men being discharged to civilian life and then to follow up these 
men several months after discharge in order to learn whether or not 
they had attempted to carry out their plans. Once the relationship 
had been established between plans or intentions expressed while in 
service and actual postseparation performance, it was hoped that 
more accurate forecasts could be made from data on the plans and 
intentions of the total population. 
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Although the agencies which requested data on soldiers’ plans did 
not always know just what kind of data they wanted or how they 
might use the data, it developed in the course of contacts with them 
that by and large they wanted ready-made predictions and not 
merely a set of facts about soldiers’ plans. In general, they wanted 
predictions of the amount of pressure on their programs which 
would come relatively early in demobilization or at least relatively 
soon after heavy demobilization had taken place. How rapidly 
should facilities of the Veterans Administration be expanded to 
meet the demands for educational benefits? How much money 
should be budgeted for the immediate demobilization period to 
cover anticipated needs? What kinds of information about job 
opportunities and problems of self-employment should be made 
available to soldiers in the demobilization process in order to facili- 
tate the smoothest possible transition to civilian life? 

To provide assistance in answering such questions, it seemed de- 
sirable to aim at predictions of the number of soldiers who might 
reasonably be expected to try to enter the activities under consider- 
ation. That is, one would not attempt to predict the number of 
soldiers who would actually enroll in colleges or succeed in starting 
businesses, but rather the number who might be expected to make 
an effort to enroll in college or to get a business into operation. 
Actual enrollment in college would depend in part on the availability 
of space in colleges, on the availability of housing for married veter- 
ans in college towns, etc. Success in starting a small business might 
depend on the availability of materials or of a site and a building. 
For planning purposes, interest was therefore primarily focused on 
demand for facilities. 

In order for data on soldiers’ plans to serve as a basis for accurate 
predictions, the following conditions would have to obtain: 

1. It should be possible to classify a large majority of all soldiers 
into definite categories with respect to their plans. In so far as 
men could not be specifically classified—the men with vague plans 
or alternative plans or no plans at all—it should be possible to spe- 
cify the general employment status which they would be most likely 
to choose and the probability that the undecided group would shift 
in large proportion to any other specific activity under considera- 
tion. It is apparent that if 8 per cent of the men planned to attend 
school but 20 per cent were completely undecided in their plans, an 
estimate of the number likely to attend school could be based upon 
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plans to attend only if one could feel confident that few of the unde- 
cided group would ultimately decide to go to school. 

2. The classification of plans should represent accurately the 
more or less organized attitudes and intentions—the orientation— 
of the individual classified, indicating a tendency to act in a given 
(general) way under a given set of stated or assumed conditions. 
In other words, as of the time of initial study the classification of 
plans should possess both stability or reliability and internal con- 
sistency (given a definition of plans in terms of the organization of 
attitudes toward a given goal). 

3. There should be no marked change in personal characteristics 
or social conditions which would influence the individual substan- 
tially to modify his plans between the time of the survey of plans 
and the time of return to civilian life. In other words, there should 
be no materialization of unexpected contingencies which would 
lead to a change of plans. 

4. Apart from changing conditions, plans should be based upon a 
sufficiently realistic evaluation of the situation and problems to be 
encountered after discharge so that the confronting of that situa- 
tion would not lead to the immediate abandonment of plans. 

It was apparent from the start that not one of these conditions 
would be completely realized. It was known that some men with 
vague plans or no plans at all must be considered potential aspirants 
in almost every field for which estimates were desired—although 
an analysis of the characteristics of the undecided group suggested 
that a large majority would seek some sort of semiskilled or un- 
skilled job with an employer. 

It was known that some of the men classified as having definite 
plans to enter this or that field were really just daydreaming and 
that others were occasionally misclassified. But, the intensive 
pretesting had yielded a fair degree of confidence that our classifica- 
tions were valid classifications of what men actually intended to 
attempt and expected to do. 

It was known that some men were making unrealistic assessments 
of conditions they would encounter or of aids which would be avail- 
able to them—but again it appeared that most men made fairly 
sober assessments. 

Finally, it seemed obvious that some men would modify their 
plans as a result of subsequent experiences while in service, and also 
that the conditions of the job market and the terms of aid available 
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to veterans would almost certainly be in a state of flux between the 
time of our surveys and demobilization. 

It is readily apparent that validation of the data on plans and 
intentions and successful prediction from those data are two quite 
distinct problems. If intentions expressed while in the service may 
be regarded as representing an organization of attitudes toward a 
particular goal, it does not follow that any specific behavior pattern 
is a requirement for validation of the measure of intentions. 

Thus, in relating performance data to plans data, we were not 
testing the validity of questionnaire responses as measures of plans 
and intentions but were testing a more general hypothesis—or 
rather a series of hypotheses—on the relationship of the question- 
naire responses given under one set of circumstances to behavior 
under a different set of circumstances at a later date. It was be- 
lieved that in so far as the conditions earlier specified did not actu- 
ally obtain, some men with definite plans would modify those plans 
or for other reasons fail to attempt to carry them out, and other 
men with only vague plans might well decide to enter any specific 
field under consideration. The problem was to estimate in ad- 
vance of any testing of the relationship just what proportion of the 
soldiers in any category would attempt to carry out their plans— 
and, hence, what cutting points should be used for prediction. 

The effect of various contingencies on the extent to which plans 
would permit prediction of postseparation behavior would clearly 
be expected to differ for different types of activities. For example, 
an increase in the amount of aid available to veterans for education 
would presumably have a greater effect on the prediction of return 
to school than on prediction of any other activity, though all activi- 
ties from which some soldiers switched to school would thereby be 
affected to some degree. 

From interviewing and the use of contingency questions it was 
possible to assess crudely the relative importance that soldiers 
themselves attached to the materialization of certain contingencies, 
but the differential nature of the effects was such that there was no 
way of preparing overall estimates of soldiers’ plans under stated 
combinations of contingencies. 

The predictions that were made, then, from the plans expressed 
in answer to questionnaires, were in large part based on assumptions 
and hunches derived from intimate acquaintance with soldier think- 
ing on the subject of postwar plans and from a much sketchier 
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knowledge of conditions and problems likely to be encountered in 
the demobilization period. 

These hypotheses, speculations, or assumptions were to be 
checked as regards the plans expressed at the time of discharge for 
men discharged in the early phase of demobilization. It was re- 
alized that any relationship established between plans and per- 
formance of this group would not, in all probability, remain in- 
variant for later separatees, but this was to be the starting point 
for evaluating plans as a basis for prediction. Again, it was recog- 
nized that even if separatees’ plans were predictive of their per- 
formance, the same might not be true of plans of men faced with 
the prospect of a year or more of additional service. In order to 
have any confidence in predictions based on plans of such soldiers, 
further evidence would be required that plans expressed in the 
absence of any expectation of discharge would be sustained up to 
and through the separation experience. 

It was not feasible to follow up after discharge a group of soldiers 
who had been surveyed as to their postwar plans before the men 
had any definite expectation of discharge. But data on the relative 
stability of plans were secured, to some degree, by comparing plans 
of separatees with those of soldiers remaining on active duty, hold- 
ing constant certain crucial characteristics of the two groups. 
Thereupon the separatees were followed up, after two to four months 
of civilian life, to find out how many attempted to carry out the 
plans they had described at separation. The technique used for 
the follow-up is briefly described at the close of the following section 
which deals with the general procedures followed in studying sol- 
diers’ plans. 


SECTION II 


APPROACHES TO ASCERTAINING SOLDIERS’ PLANS 


The interview. Interviewing soldiers with reference to the prob- 
lem area under consideration before attempting to formulate a spe- 
cific set of questionnaire items was a procedure regularly followed 
in major surveys, especially in studies which tapped areas not al- 
ready explored in detail by previous surveys. In those instances 
where the objective of a study was to ascertain attitudes toward a 
particular problem, situation, or group, the interview might be 
fairly narrow in scope and the relevant data might be obtained by 
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a small number of questions. In questioning soldiers as to their 
postwar plans, it was found that a few general questions would give 
a fairly good indication of the general trend of the individual’s 
thinking—whether he planned to return to his previous job, seek a 
new job, start a business enterprise, or attend school—but that a 
check on the degree to which plans were thought out and were re- 
alistic for the individual planner required a good deal of probing 
into the details of those plans. This finding suggested that the 
interview was perhaps the only suitable method for arriving at an 
accurate classification of soldiers’ plans. In an interview it was 
possible to “‘feel out’’ the soldier, prodding to find out how well 
crystallized and how realistic his plans seemed, suggesting possible 
problems which might prevent the achievement of planned goals 
and getting the reactions of the respondent to those problems. 
Individual A, the soldier who stated categorically that he would 
have a restaurant of his own after the war, might hesitate and 
backtrack when asked about the cost of equipping a kitchen or the 
problem of locating a suitable site and getting into operation with a 
minimum of capital while supporting a family. These questions 
had to be asked in order to evaluate the plans of Individual A, but 
they would be completely irrelevant to Individual B who planned 
to return to school. And in an interview, only the relevant ques- 
tions need be asked, i.e., those questions relevant to filling in some 
of the details of the specific plan or plans of the respondent and 
checking his reactions to alternative courses which he might rea- 
sonably be expected to follow. 

This gave the interview a tremendous advantage over the ques- 
tionnaire, which could not hope to present the relevant questions 
for each individual soldier and could not prod and probe in the 
same sense as an interviewer could. But interviews are relatively 
costly, since they require a great expenditure of time by trained 
personnel, and any adequate classification and analysis of soldiers’ 
plans would require a carefully selected sample of many thousands 
of cases. The need for a large sample is readily apparent if one 
considers the task of classifying and analyzing plans to enter any 
single area of interest to the study. Only a small proportion of the 
men in service had definite intentions to start a business, for exam- 
ple, yet it was desired not merely to estimate the magnitude of this 
proportion but also to establish what kind of men had such plans, 
what specific businesses they planned to enter, what experience 
they had had in such businesses, and what arrangements they ex- 
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pected to make in order to finance these enterprises. This could 
be accomplished only by taking a total sample sufficiently large to 
yield several hundred cases in each of the major areas of interest. 

The small field staff of the Research Branch, relative to the 
volume of research demands, precluded any possibility of doing an 
adequate study of soldiers’ plans by any method other than group 
administration of questionnaires. Thus the problem became one 
of devising a questionnaire which would both give an adequate 
classification of soldiers’ plans and intentions, by broad areas, and 
provide sufficient detail to evaluate the plans and make the data 
useful for planning purposes. 

The questionnaire. Like most questionnaires, the final form used 
for the study of soldiers’ plans was not arrived at by a single effort 
but was the result of considerable testing and revision. In fact, in 
the amount of testing and reformulation, this questionnaire was 
perhaps exceptional. A brief description of the steps in the devel- 
opment of the final form of the questionnaire will indicate the rea- 
sons for this. 

The development of the questionnaire through pretesting. The first 
questionnaire pretest in the general area of postwar plans was pri- 
marily an attempt to secure a set of questions which would assess 
the amount and conditions of interest in remaining in the Army 
after the end of the war. It also tested a selection of items from a 
previously developed battery designed to rank soldiers according 
to their interest in postwar education, and contained several ques- 
tions relating to general job plans. 

A battery of questions which permitted a rough ranking of re- 
spondents on interest in postwar Army service was developed, and 
the items relating to postwar education again provided a reasonably 
good ranking with respect to this area. It was found, however, 
that a number of individuals who were completely consistent in 
saying that they would attend school, in answer to a series of ques- 
tions relating only to school (or were likewise consistent in express- 
ing plans for an Army career), said they would do something else 
when presented later in the questionnaire with a general question 
permitting the selection of one of a series of alternatives. These 
alternatives included plans to do the same kind of work as the sol- 
dier had previously done, plans to do a different kind of work, plans 
for school, and plans to remain in the Army. 

This seemed to indicate either a lack of any stable plans or a tend- 
ency of some men merely to respond to suggestion when presented 
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with a direct question or series of questions relating to plans with 
respect to a single course of action. It appeared also that question 
order was extremely important in determining responses to single 
items or to whole scale batteries. What would happen if the ques- 
tion involving alternatives were to precede the questions relating 
to school or to an Army career, or if question batteries were broken 
up and distributed throughout the questionnaire? 

A second pretest was designed to test the effects of context on 
response, using four questionnaire forms which were administered 
interleaved to approximately two thousand men. Results were 
not conclusive, in the sense of showing uniform shifts when direct 
items or scales were preceded or followed by general questions pre- 
senting or implicitly containing alternatives, but a number of 
marked shifts were observed. In general, however, a direct ques- 
tion relating to plans for a single course of action was found to draw 
more affirmative responses for that course of action than a question 
which contained that course as one alternative, regardless of order 
of presentation. 

The second pretest also contained a number of questions relating 
to interest in farming, in self-employment, and in migration. Rela- 
tively high consistency was found in the expression of plans for 
farming, which came predominantly from men with a farm back- 
ground. Surprisingly high aspirations for owning a business were 
found, though in large proportion the aspirants indicated other plans 
when presented with alternatives. Expression of plans for migra- 
tion seemed likewise not wholly consistent either within the set of 
items relating to migration or in comparison with occupational 
plans. 

A final pretest was set up to sharpen the items used and to achieve 
an item order which would minimize inconsistency and effects of 
suggestion. Questions on plans were preceded by a set of questions 
on preinduction employment experience and were introduced by a 
write-in question relating to general job plans. This form was on 
the whole found satisfactory and, with some modifications of indi- 
vidual items, was administered to more than twenty-three thousand 
officers and enlisted men in the continental United States and over- 
seas in the summer of 1944. 

The final questionnaire used in the summer of 1944 was designed 
not merely to classify individuals according to their relative interest 
in following a particular course of action after the termination of 
hostilities, but to analyze the details of their plans. It was aimed 
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at securing reasoned evaluation of plans—recognizing the limitations 
of the questionnaire for achieving this purpose—rather than reflec- 
tions of aspirations. It sought to minimize the effects of suggestion 
by first getting the respondent to write out his plans in answer to a 
general question which contained no suggestive elements. 

The adequacy of a questionnaire to fulfill the requirements for 
which it is designed is not merely a matter of asking the right ques- 
tions. An integral part of the problem of survey design is the plan 
for analysis of the data secured. In the case of the study of postwar 
plans it was found that differing approaches to analysis could yield 
markedly different estimates from the data. The analysis plan for 
the first major study of postwar plans was in many ways awkward. 
The codes set up for classifying data contained in the write-in ques- 
tions were exceedingly complicated because of the number of dimen- 
sions involved. Therefore, even before the results of the July 1944 
survey had been fully reported, additional pretesting was begun to 
try to simplify the analysis plan and to secure additional detail on 
plans for business ownership and school attendance. Further ex- 
periments were conducted on the effects of context, and an attempt 
was made to arrive at a sequence of items which would minimize 
suggestion without requiring use of a general write-in question. It 
was found, however, that the use of the general write-in question as 
a lead-off item and the general sequence of the 1944 version gave 
results which seemed less affected by suggestion than did any of the 
alternatives tried. The final questionnaire used in the summer of 
1945 was, therefore, basically very similar to the form used a year 
earlier, though expanded in some respects (plans to work for an 
employer, plans to own a business, plans for school) and contracted 
in others (plans for further Army service, plans to migrate). Per- 
haps the most important change was the addition of a question 
toward the close of the questionnaire asking for a reconsideration 
and final evaluation of the general alternatives of employment 
status. 

The preceding description of the steps in the development of the 
two forms of the questionnaire which resulted in estimates of sol- 
diers’ plans has merely touched upon some of the general problems 
encountered. It has been suggested that the adequacy of the ques- 
tionnaire itself can be judged only in terms of the plan of analysis. 
The plan of analysis, however, was based in large part on assump- 
tions with respect to the specific fields of interest covered by the 
questionnaire. For this reason, further analysis of the problem of 
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ascertaining soldiers’ plans will be directed to several of these fields 
of interest—particularly to the problems of ascertaining plans for 
school attendance, business ownership, and returning to previous 
employer. In order to avoid undue repetition at a later point, the 
use of the data for prediction purposes and the accuracy of the pre- 
dictions made will be discussed along with the technique of ascer- 
taining plans for each area of interest. Before taking up these 
detailed analyses, however, a brief note on the follow-up technique 
is in order. 

The problem of ascertaining performance. The general hypothesis 
to be tested by following up separatees was that men who expressed 
plans to follow a given course of action after their discharge— 
whether school attendance, returning to previous employer, self- 
employment, etc.—would actually attempt to follow that course 
of action. Therefore, the criterion to be evaluated was not the 
activity in which the veteran was involved at any particular time 
after discharge, but his efforts to enter a given field within a few 
months after leaving the service. 

The method projected for the follow-up was to mail question- 
naires to a sample of separatees from four widely scattered separa- 
tion centers asking about their occupational activities, interests, 
and efforts. This questionnaire, again, had to be simple enough so 
that men with only a grade school education could answer it, yet 
detailed enough to permit evaluation of the criterion. A crucial 
problem, of course, was the definition of an “‘attempt’’ to carry out 
previously expressed plans. If a veteran actually succeeded in 
realizing his plan, or was actively engaged in pursuing it at the time 
of the follow-up, he could clearly be classified as having attempted 
to follow this particular course of action. If, however, he merely 
inquired about entering the field he had planned to enter, received 
a discouraging response to his inquiry, and gave up his former plans, 
it seemed doubtful that he should be classified as having made an 
attempt to realize the plans. For each of the fields with respect 
to which plans were evaluated, explicit criteria were established. 
These will be discussed in the sections relating to the several areas 
of interest. 

The mail follow-up of men discharged in July 1945 yielded re- 
turns from 88 per cent of the group surveyed. In order to check on 
possible response bias, interviews were conducted with the bulk of 
the nonrespondents from two of the four separation centers. No 
response bias was apparent. 

A second follow-up study was conducted among men separated 
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at the peak of demobilization (December 1945) in order to check 
on the effect of a markedly changed economic situation on the pre- 
viously observed relationships between plans and performance. 
Again a response rate of 88 per cent was secured by mail question- 
naire alone. Although a slight educational bias was observed, the 
questionnaires returned are believed to be sufficiently representative 
of the total group surveyed to provide acceptable evidence.‘ 


SECTION III 
POSTWAR EDUCATIONAL PLANS OF SOLDIERS 


Somewhat less than a tenth of the Army’s enlisted men and 
about a fifth of its officers were full-time students just before they 
entered the service. Some of these men had just completed high 
school or college when they enlisted or were inducted. Others cut 
short their education to go to war. Still others had left schools and 
colleges to take jobs in war industry. Cross-sectional surveys indi- 
cated that about 13 to 15 per cent of the total Army population at 
the time of peak strength had been out of school for less than one 
year when they entered service. Had there been no war, it is likely 
that the number of these men in school would have declined sharply 
with each year of aging, as some finished high school, some finished 
college, and some simply dropped out of school. It is virtually 
certain that not more than 5 per cent of the men demobilized from 
the Army during 1945-1946 would still have been in school at that 
time had there been no war. 

On the other hand, it is probable that many of the Army’s officers 
and enlisted men who had graduated from high school and then 
taken jobs would have liked to attend college if financial obstacles 
had not seemed too difficult to surmount. Given financial aid, 
they were potential students again. Other men, who had dropped 
out of school because they were not sold on the value of further 
education, saw in the Army the enormous advantages secured by 
their better educated fellow soldiers. Given another chance, they 
might choose to better their competitive position by seeking further 
schooling. 

Because of the possible effects of government aid for schooling 
and the effects of Army service itself on an interest in further edu- 
cation, the plans of soldiers to return to full-time school were sought 
as an indication of the probable rate of return. 


*The techniques used in the mail follow-ups have been described in an article by 
John A. Clausen and Robert N. Ford, ‘Controlling Bias in Mail Questionnaires,” 
Journal of the American Statistical Association, Vol. 42 (December 1947), pp. 497-511. 
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Ascertaining Plans for School 


The initial survey relating to plans for school was conducted 
nearly a year before the Servicemen’s Readjustment Act of 1944 
provided for aid to veterans wishing to return to school. The 
function of the survey was to provide an estimate of the proportion 
of soldiers who might be expected to return to school even without 
a program of federal aid, and an estimate of the proportion who 
might go if such aid were available. To this end a set of questions 
was developed relating to interest in returning to full-time school 
under various contingencies. These questions were carried toward 
the close of a questionnaire dealing primarily with problems of 
morale and adjustment. The sequence of questions on interest in 
postwar education followed immediately after a question on gen- 
eral occupational interest. The items with which we are chiefly 
concerned are given below, along with the percentage distribution 
of respondents to each category. (Question numbers are those in 
the original survey.) 


86. If you could do what you wanted, would you like to go back to full- 
time school or college after the war? (Check one) 


Yes 35% 
No 48 
Undecided 15 
No answer 2 
100% 


87. Regardless of what you want to do, do you think you will actually go 
back to full-time school or college after the war? (Check one) 


Yes 8% 
No 77 
Undecided 12 
No answer 3 
100% 


88. Do you think you would go back to full-time school or college after the 
war if the Federal government or your state government were willing 
to help pay your way? (Check one) 

I expect to go back to school whether or not the Federal or 

state government helps me 8% 
I will go back if the Federal or state government helps me 23 
I will not go back, even if the Federal or state government 


would help me 38 
Undecided 26 
No answer 5 


90. 


91. 


92. 


93. 
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If you were offered a good job at the end of the war, do you think you 
would turn it down and go back to full-time school or college, or do 
you think you would take the job? (Check one) 
I would turn the job down and go back to school 6% 
If I could get some help, I would turn the job down and go 
back to school 


I would take the job and not go back to school 69 

Undecided 11 

No answer 5 
100% 


If you could get some kind of job after the war, but could not get a good 
job, do you think you would turn it down and go back to full-time 
school or college, or do you think you would take the job? (Check 


one) 
I would turn the job down and go back to school 20% 
If I could get some help, I would turn the job down and go 
back to school 18 
I would take the job and not go back to school 42 
Undecided 14 
No answer 6 


100% 


If you couldn’t get any job at all after the war, do you think you 
would go back to full-time school or college? (Check one) 


I would go back to school 21% 

If I could get some help, I would go back to school 27 

I would not go back to school 30 

Undecided 17 

No answer 5 
100% 


If the war lasts longer than you expect, will that make any difference 
in whether you go back to full-time school or college after the war? 
(Check one) 
I don’t think I will go back to full-time school or college 40% 
I will go back to full-time school or college, even if the war 


lasts longer than I expect 10 

I will probably not go back to full-time school or college 27 

Undecided 17 

No answer 6 
100% 


Tf you could do what you wanted, which would you rather do after the 
war—go to full-time school or college, or go to part-time school or 
college? (Check one) 

I would not like to go to school or college at allafterthe war 35% 


I would like to go to full-time school 18 
I would like to go to part-time school 29 
Undecided 12 
No answer 6 


100% 
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95. Regardless of what you want to do, do you think you will actually go 
to full-time school or college, or go to part-time school or college 
after the war? (Check one) 

I don’t think I will go to school or college at allafterthe war 52% 
8 


I think I will go to full-time school 

I think I will go to part-time school 19 

Undecided 16 

No answer 5 
100% 


A careful scrutiny of the “‘marginals’’ reported for each question 
indicates considerable variation of the proportion interested in full- 
time school under varying contingency specifications. While 35 
per cent of the soldiers said they would like to go to full-time school 
or college after the war in answer to the first question in the se- 
quence, only 8 per cent said they thought they would actually go 
in answer to a direct question which mentioned no contingencies. 
Under the most favorable contingency from the point of view of 
school attendance—government aid available for education but no 
jobs available—48 per cent of the men said they would go to full- 
time school (Q 92). Thirty-eight per cent said that with aid avail- 
able they would go to full-time school, if they were able to get some 
kind of job but not a good job, while 15 per cent said they would 
turn down a good job to go back with government aid. In consid- 
ering each of these job contingencies, we find that roughly half of 
those who said they would go to full-time school indicated that they 
would do so only if government aid were available. Mention of 
the single contingency of government aid, with no mention of the 
job market (Q 88) led 31 per cent of the sample to say they would 
return if such aid were available. 

The proportions of men who said they would go back without 
aid, and the proportions who said they would go back if aid were 
available are summarized for various contingencies in Table 1. 

When the job and government aid contingencies were dropped, 
10 per cent of the men reported that they would go back “‘even if 
the war lasts longer than I expect”’ (Q 93), and without mention 
of contingencies, 8 per cent again said they would actually go back 
to full-time school (Q 95). A consideration of these two figures 
gives a clue to the interpretation of responses to contingency ques- 
tions. Interviews with enlisted men had established the fact that 
many men who hoped to return to full-time school were worried 
that they would be too old to return if the war were to last very 
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long. Yet responses to the unfavorable contingency actually 
showed a higher proportion reporting plans for full-time school 
than when no contingency was presented. However, the unfavor- 
able war-duration contingency followed the most favorable of all 
situations presented—government aid for education and no jobs 
available—and it appears that in answering this question on the 
effect of a long war, some men carried in their minds the previously 
presented situation. That is, they were considering all three fac- 
tors—aid, unfavorable economic situation, and duration of war— 
in answering the question which was intended merely to present 


TABLE 1 


Puans FoR Foui-rime ScHoot Expressep 1N ANSWER TO QUESTIONS 
PresenTING VaRgious CONTINGENCIES RELATING TO THE 
AVAILABILITY OF JOBS AND OF GOVERNMENT AID 


Question Say they would Say they would 
number Job contingency go without aid go with aid* 
88 None mentioned 8% 31% 
90 Good job available 6 15 
91 Some kind of job available 20 38 
92 No job available 21 48 


* Includes those who would go without aid. 


the third of these. By the time they got to the final question, ask- 
ing for a self-prediction without specification of contingencies, it 
would appear that they were making a more general evaluation of 
the situation. 

It must be realized that whether or not contingencies were men- 
tioned in the questions asked, the plans expressed by the soldiers 
were to a considerable degree contingent plans. There is evidence 
that, by and large, responses to contingency questions do not rep- 
resent careful consideration of the specific contingencies presented, 
but merely reflect general reaction to a more or less favorable situa~- 
tion than that assumed (in a given context) if no conditions are 
mentioned. 

The data previously presented on plans for school attendance, 
with government aid and without such aid under various job con- 
tingencies (Table 1), show marked shifts in the differential effects 
of aid under various circumstances. It is apparent that many men 
must have found it difficult to evaluate the various contingencies 
in a consistent fashion. This is not surprising, for “a good job” 
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and “‘some kind of job [that is] not a good job” are concepts not 
easily evaluated, and the same is true of the availability of an un- 
specified amount of government aid to weigh against job contingen- 
cies. It appeared, then, that the condition of the job market and 
the availability of government aid might have great influence on 
the extent to which veterans would return to school, but it seemed 
doubtful that soldiers’ reports of their plans under various contin- 
gencies could be accepted at their face value. 

In addition to the problem of interpreting the answers to contin- 
gency questions there were a number of inconsistencies which were 


TABLE 2 


INTERRELATIONSHIP OF REsPONSES TO Two ALMosT IDENTICAL 
Questions on EpucaTiIonaL PLans 


(Question 87 and Question 95) 


(Per cent) 
95. Do you think you will act- 
ually go to full-time school 87. Do you think you will actually go back 
or to part-time school? to full-time school? 
RESPONSE CATEGORY CHECKED RESPONSE CATEGORY CHECKED 
Undecided or 
Total Yes No no answer 
Total 100.0 8.2 76.8 16.0 
Full-time school 8.3 4.9 2.0 1.4 
Part-time school 19.3 2.3 13.4 3.6 
Not planning to go to school 51.4 0.4 48.3 2.7 
Undecided or No answer 21.0 0.6 13.1 7.3 


difficult to explain. For example, more than a third of the men who 
said they would turn down a good job and go to school even without 
government aid failed to predict that they would go when asked 
what they thought they would actually do. Perhaps the most 
startling evidence of inconsistency came from almost identical 
questions asking what the soldier thought he would actually do, 
separated in the questionnaire by seven items dealing with various 
contingencies. The cross tabulation of these two questions is 
given in Table 2. 

It will be noted that the two questions give almost identical esti- 
mates of the number of soldiers planning to attend full-time school 
—8.3 per cent and 8.2 per cent—but that only 4.9 per cent are con- 
sistent in expressing such plans to both questions. It appears that 
some men who answered question 87 affirmatively were really inter- 
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ested in part-time school, but, given no opportunity to express 
interest in part-time schooling, they reported plans for full-time 
school in order to avoid seeming negative about any further educa- 
tion. But it is less apparent why 3.4 per cent of the men were 
negative or undecided in answering question 87, yet said they ex- 
pected to go to full-time school in answer to question 95. Analysis 
of the relation between responses to those two questions and re- 
sponses to the intervening questions reveals two possible explana- 
tions, however. First, there were a nuniber of men with only grade 
school education who checked in answer to one or the other of these 
questions that they expected to attend full-time school although 
their response to the initial question in the sequence (Q 86) indicated 
that they had little interest in or desire for further schooling. These 
men with little schooling were apparently quite suggestible when 
asked a whole series of questions about school attendance. Second, 
there were a number of men who expressed themselves as undecided 
or said they did not expect to attend school in answer to question 
87, but who said that they would go if government aid were avail- 
able (Q 88). Having considered the possibility of such aid, these 
men may have been led thereupon to say they would actually go 
back, when given another chance to predict their future. 

In view of these inconsistencies, how could the data best be uti- 
lized to give some estimate of the proportion who seemed likely to 
return to school after the war? Should responses to some one item 
be taken as the best indication of probable return to school, should 
complete consistency be required on all items, or could some other 
criterion be applied? 


School Plans as a Scale Continuum 


Responses to the battery of questions on educational plans were 
analyzed by the technique of scale analysis described elsewhere in 
this volume. The items were found to yield a fairly good approxi- 
mation to a scale (according to the less rigorous criteria for scalo- 
gram analysis used in the early days of the Research Branch) after 
certain combinations of response categories had been made. From 
the data already presented on inconsistencies among responses, it 
will be obvious that the amount of scale error for responses to cer- 
tain items was considerable. 

An examination of the cross tabulation of questions 87 and 93 by 
educational groups (Table 3) is illuminating. To minimize incon- 
sistencies resulting from errors in checking, the relationship be- 
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tween responses to the two questions is presented only for those 
men who had earlier checked that they ‘‘would like to go back to 
full-time school or college after the war.’’ It will be noted that, 
relative to the proportion of each educational group who expect to 
return to full-time school, the contingency of a longer war than 
expected dissuaded a higher proportion of the men whose education 
stopped short of high school graduation than of those of higher 
educational status. More strikingly, however, among both of the 
lower educational groups the seemingly negative contingency actu- 


TABLE 3 


Errzcr or LenetH-OF-Wark CONTINGENCY ON Expressep PLANS 
or Souprers Wo Say Tuey Wovutp Like to Rerurn 
To Foui-Tiwm Scxoou, sy EpucationaL Leven 


(Per cent) 


EDUCATIONAL LEVEL OF RESPONDENT 


Less than 
COMBINED RESPONSES high school High school Some 
TO QUESTIONS 87 AND 93 graduation graduation college 
Total who would like to return to full-time 
school 100 100 100 
Q87: Think they will actually return 16 17 41 
Q93: Say they will return even if war 
lasts longer than expected (7) (10) (28) 
Q93: Say they will not return if war lasts 
longer than expected (9) (7) (13) 
Q87: Think they will actually not return 84 83 59 
Q93: Say they will return even if war 
lasts longer than expected (10) (11) (3) 
Q93: Say they will not return if war lasts 
longer than expected (74) (72) (56) 


ally drew a higher proportion of affirmative responses from men 
who had said they did not expect to return to school than it dis- 
suaded from those who did plan to return. Among college men, on 
the other hand, the contingency was clearly evaluated as negative— 
it dissuaded 13 of the 41 per cent who had reported the expectation 
of returning to school, while it drew a positive response from only 
3 of the 59 per cent who had not reported the expectation of return- 
ing. 

One may hypothesize either that the less well-educated men were 
less consistent in their responses (perhaps because of suggestibility, 
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perhaps because of not having clearly formulated plans) or that the 
contingency had a different meaning to the college men than to 
those of less previous schooling. The soldier who needed only a 
year or two more of college to take his degree might be less discour- 
aged from continuing on after discharge by having to spend a longer 
time in service than the soldier who had not yet begun college. 

Even though the questions presenting various contingencies which 
might affect return to school after the war tapped more than a single 
dimension and thus did not yield a true scale, the technique of scale 
analysis did help to give a clearer picture of the patterns of relation- 
ship among response items than could be readily obtained by simple 
cross tabulation of questions. The combination of categories was 
facilitated by studying these patterns of interrelationship in the 
light of manifest content and the findings of intensive interviews. 

By treating the questions and responses by the method of scale 
analysis it was possible to get a somewhat clearer picture of the pat- 
terns of response and the combinations of categories which seemed 
most reasonable in the light of their interrelationships. Moreover, 
by scoring individuals to the nearest scale type, it was possible to 
classify individuals who omitted one or two of the questions and to 
classify some of those who were inconsistent on only one or two 
items. That is, these individuals were classified as belonging to 
the scale type to which they could be scored with the least number 
of scale errors. Thus it was possible to rank individuals in a small 
number of groups, ranging from those who were consistently posi- 
tive in expressing plans for school to those who had no interest in 
further full-time schooling, although rankings were crude and some- 
times equivocal. But, given such a ranking of individuals based on 
their plans for school, how did one go about establishing a cutting 
point such that one would predict school attendance after discharge 
for men above the cutting point and predict nonattendance for men 
below the cutting point? Scale theory did not answer this ques- 
tion, and the cutting point actually used was based on considera- 
tions of consistency and manifest content of the items. 


The Assessment of Plans for Predictive Purposes 


It will be recalled that 8 per cent of the men surveyed in the sum- 
mer of 1943 said they thought they actually would return to school. 
Nearly 7 of these 8 per cent were consistent in saying that they 
would like to go to full-time school, would go to school even without 
government aid, and would go back if offered any job but one they 
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considered a good job. Only about 4 of the 7 per cent, however, 
said they would return to school “even if the war lasts longer than 
I expect,”’ and the same proportion (though not the same individu- 
als) said that even if offered a good job they would turn it down and 
go to school without government aid. Only about 3 per cent said 
they would go under all contingencies presented. 

It was felt that the 7 per cent who said they actually expected to 
return to school and were consistent in responding affirmatively to 
all of the questions except those dealing with the contingency of a 
good job and of a long war had rather coherent attitudes toward 
going back. This figure, therefore, was taken as an upper bound 
to the per cent who would go back if no aid were provided. But 
the analysis also considered the characteristics of the men who re- 
ported plans to attend school. It was felt that men who had been 
out of school more than a year before entering the Army, men who 
were married, and those over twenty-five years of age, would en- 
counter great difficulty in attempting to go back to school, and 
therefore, even though their determination to attend might be 
great, relatively few could be expected to return. These men com- 
prised nearly half of the 7 per cent who consistently expressed plans 
to return to school. Therefore, a lower bound of the estimate of 
school attendance by soldiers was set at 3.5 per cent, providing aid 
were not available to the soldier interested in returning to school. 
Because soldiers’ thinking about the various contingencies of aid 
and the conditions of the labor market seemed rather vague, no 
estimate of probable enrollment under any particular set of contin- 
gencies was attempted. It was pointed out, however, that many 
of the men who said they might go to school if aid were available 
did not seem likely prospects for enrollment because they were over 
twenty-five, were married, or had been out of school for a number 
of years before they entered the service. 

The first survey in the area of educational plans, then, indicated 
that responses to contingency questions could not be taken at face 
value, showed a large element of uncertainty and inconsistency in 
soldiers’ plans for full-time education, and posed the problem of 
setting the cutting point for prediction when soldiers were ranked 
according to their degree of interest in full-time education. 

Subsequent pretests revealed that a question which confronted 
the soldier with a choice of alternative courses of action—private 
employment, governmental employment, self-employment, school 
attendance, or continued Army service—gave a somewhat lower 
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estimate of the proportion with plans to attend school than did the 
cross tabulation of those questions relating solely to plans for school 
attendance which had previously been used as a basis for plans 
classification. 

A pretest carried out among four hundred white enlisted men at 
a single post found that of 44 individuals who had checked that 
they expected actually to attend full-time school in answer to the 
direct question on educational plans, only 24 also checked that they 
planned to attend school in answer to the question presenting alter- 
natives. This suggested that reports of educational plans when 
the question context was solely educational interests were not com- 
parable with reports when the context included alternative areas of 
interest. Men who were considering the possibility of attending 
full-time school after the war might say they actually expected to 
go to school when asked only about educational plans, but on con- 
sidering the alternative of going to school or taking a job might 
decide that they would probably take the job. 

Among men who checked that they planned to attend school in 
answer to the question involving alternatives, the vast majority 
also indicated that they would like to attend full-time school, that 
they planned to attend if government aid were available, and that 
they thought they actually would attend. Thus, this response 
could be used to obtain greater discrimination at the top of the 
crude scale ranking individuals according to their school plans. It 
served to categorize a group of soldiers who were not only highly 
interested in returning to school, but who were leaning more strongly 
in this direction than in any other direction with respect to their 
postwar plans and intentions. 

When the final version of the survey was put into the field in June 
1944, the following four questions were used to classify soldiers 
with respect to their plans for full-time school (marginals shown 
are based on responses of 19,556 white enlisted men surveyed in the 
United States and overseas, without correction for a minor sam- 
pling bias in level of education of respondents) : 


32. Do you expect to work for yourself or for someone else 
right after you leave the Army? (Check one) 


I expect to work for myself 2A% 
I expect te work for the government 

(Federal, state, city, etc.) 9 
I expect to work for a relative 


(father, uncle, etc.) 5 


698 


These four questions again permitted a rough ranking, using scale 


42. 


43. 
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I expect to work for some other employer 
(company, person, etc.) 

I plan to go to full-time school 

I don’t have any definite plans 

No answer (including double checks) 


Do you feel that you have had as much school or college 
as you want, or would you like to go back to school or 
college after the war? (Check one) 
I have had as much school or college as I want 
I would like to go back to full-time school or college 
I would like to go back to part-time school or college 
No answer 


It has been planned that the federal government will 
pay tuition and other educational expenses plus a living 
allowance of $50 per month to veterans who qualify and 
want to continue their schooling. Do you think you 
would go back to full-time school or college after the 
war, if this is done? (Check one) 
I expect to go back to full-time school whether or not 
the government helps me 
I will go back to full-time school tf the government 
provides this help 
I will not go back to full-time school even if the gov- 
ernment would help me 
Undecided 
No answer 


44, Regardless of what you would like to do, do you think 


you will actually go back to school or college after the 
war? (Check one) 
Yes, to full-time school or college 
Yes, to part-time school or college 
No, I don’t think I will go back to school or college 
Undecided 
No answer 


analysis, giving the following ideal scale types: 


1. 


Would like to go to full-time school 

Thinks he will actually go 

Expects to go whether or not the government aids him 
Chooses school among alternatives 


4 


wW~Ieo oo 


100% 


12% 
22 


41 
21 
4 


100% 
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2. Would like to go to full-time school 
Thinks he will actually go 
Expects to go whether or not the government aids him 
but 


Does not choose school among alternatives 


3. Would like to go to full-time school 
Thinks he will actually go 
but 


Expects to go only if the government aids him 
Does not choose school among alternatives 


4, Would like to go to full-time school 
but 


Does not think he will actually go full-time 
Expects to go only if the government aids him 
Does not choose school among alternatives 


5. Indicates neither interest in going to full-time school nor plans to attend 


Several additional scale types relating to interest in part-time 
school were also obtained, but need not concern this discussion. 
One nonscale type which was not large but was important was noted : 
nearly 2 per cent of the men deviated from type 1 in that they said 
they expected to attend full-time only if the government provided 
some aid. These men were considered as belonging with type 1 for 
predictive purposes, since government aid was assured. 


Soldiers’ Educational Plans in the Summer of 1944 


Roughly 7 per cent of the survey sample of white enlisted men‘ 
fell into type 1 or the slightly deviant type as described above. 
These men were categorized as having definite plans for enrollment 
in full-time school. Another 4 per cent of the survey sample fell 
into types 2 and 3, which differed only in respect to the expectation 
of having government aid available. Since it was known that such 
aid would be available—the Serviceman’s Readjustment Act of 
1944 was passed as the survey went into the field—it was felt that 
types 2 and 3 should be combined. These men were thereafter 
categorized as having tentative plans to attend full-time school, 
even though they did not express plans to attend school when con- 
fronted with a choice of alternatives. 

Four fifths of those with definite plans were high school graduates, 
under twenty-five years of age, and unmarried. All but 3 per cent 

* Because white enlisted men comprised more than four fifths of the Army’s total 
strength and because many of the pretests were conducted exclusively within this seg- 


ment, the data to be presented will relate entirely to white enlisted men. Surveys were, 
however, conducted among officers and among Negro enlisted men. 
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had at least two of these characteristics. Roughly two thirds of 
the men with definite plans to return to school expected to attend 
college, while most of the remainder expressed interest in trade and 
business school courses. Less than 5 per cent were planning to 
return to academic high school work. 

The findings with respect to soldiers’ plans for postwar education 
were presented in detail in a separate report.° Since the interpre- 
tation of the data was crucial to their use for planning purposes, a 
full quotation of the interpretation offered in the final report is 
given below: 


The method used in classifying men’s educational plans is rigorous enough so 
that it may safely be said that very nearly all the eight per cent (of officers and 
enlisted men) classified as definitely planning full-time school really would go back 
to full-time school if the Army were demobilized today. Relatively few of the men 
with tentative plans for full-time school and the men who are now planning part- 
time school though they would like to go to full-time school can be expected to re- 
turn to full-time school, especially in view of the fact that many of them have fairly 
definitely formulated employment plans. Tentatively, then, the best prediction 
of the number of men who will return to full-time school is in the neighborhood of 
eight per cent. 

The Army is, however, being demobilized not today, but in the indefinite future. 
The question to which an answer is wanted is: How many men actually will return 
to school after demobilization? The figure of 8 per cent given above is the best 
answer which can be offered at the present time, but it is possible to point out fac- 
tors in the situation which may alter this percentage. 

Analysis of a study made among white enlisted men in the United States in 
October, 1944, indicates that their knowledge of the educational provisions of the 
“GI Bill of Rights” is still scanty and inexact. Less than a fifth of all the men 
know even three broad features of the provisions—that a man who was 24 years 
old when he entered the Army can get aid to return to school even though he had 
been out of school for several years before his induction; that the maximum amount 
provided for tuition and fees is $500; and that a single man is to receive a mainte- 
nance allowance of $50.00 a month—and only a third of the men definitely plan- 
ning to return to full-time school know this much about the GI Bill. It is quite 
probable that men serving overseas have even less information about it. 

Thus far, the GI Bill of Rights has not been an outstanding factor in the decision 
to return to school. In fact, only a quarter of the white enlisted men definitely 
planning to go to full-time school are depending on qualifying for benefits. (This, 
of course, does not mean that the remainder will not apply for benefits.) But 
among the men whose plans are not definite are many who could not go to school 
without aid and who are not yet fully informed about the existence of aid. Half 
the white enlisted men who are tentatively planning full-time school say that they 
need government aid in order to return, and all the men who would like to go full- 
time but are planning only part-time attendance think of the possibility of attend- 
ing full-time school only in terms of receiving government aid. It is only reason- 


* Originally issued as Report B—121 in October 1944 and reissued in March 1945 as 
Report B-133, “Post-War Educational Plans for Soldiers,’’ one of a series entitled Post- 
War Plans of the Soldier. 
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able to suppose that some of these men will take advantage of the opportunity the 
GI Bill offers, once they know about it. 

Economic conditions during and after the demobilization period will influence 
decisions about schooling, also. Widespread unemployment might lead men with 
no real interest in further education to return to school for the sake of government 
allowances, though the unemployment benefits provision should tend to minimize 
this. On the other hand, if good jobs at high pay are plentiful some men now 
thinking in terms of school may indefinitely postpone further education. 

The date at which demobilization occurs may also affect men’s decisions to re- 
turn to school, since most men are implicitly making some assumption about the 
length of the war in deciding whether or not to return to school. The length of 
time one has been out of school, one’s age and marital status, themselves inter- 
related variables, are highly related to going back to school If the war lasts be- 
yond their expectations, many men now planning to go to school may have taken 
on the responsibility of wives and children, or may come to consider themselves 
too old or too long out of school to attempt to resume. Offsetting this to some 
extent will be the continuing induction of eighteen year olds, which tends to main- 
tain the size of the younger age group from which most of the prospects for full- 
time school come.’ 


Later tabulations on the characteristics of the Army as of summer 
1944 revealed that the sample used to prepare the basic estimates of 
soldiers’ plans for school contained a slight surplus of men under 
twenty-five who were high school graduates or had started college 
before entering the service—men of the type most likely to plan to 
return to school. This bias led to an overstating of plans for full- 
time school. The original estimate for white enlisted men should 
have been closer to 6 than to 7 per cent. In subsequent studies, 
therefore, the estimates of school plans for the Army as a whole 
were derived by computing the proportions having such plans by 
age groups within the various educational levels. These rates 
were then applied to the known distribution of the Army by age 
and education. 


Plans of Separatees 


It was found that with increasing knowledge of the GI Bill of 
Rights, there was an increase in the proportion of soldiers planning 
to return to full-time school. In the summer of 1945, when surveys 
were conducted both among separatees and men remaining in service 
in the continental United States, it was estimated that about 8 per 
cent of all white enlisted men in service or in process of demobiliza- 
tion had definite plans to return to school. It was at this point 
that the quantitative evaluation of plans as a basis for predicting 
performance was undertaken. 


7 [bid., pp. 7-8. 
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The same basic questionnaire was used to survey plans of men 
remaining in service and plans of separatees. The questions on 
plans for school were those previously used, except for dropping the 
question on the effect of the availability of government aid, since 
this was no longer a contingency. 

It will be recalled that the first step in evaluating plans expressed 
while in service, as a basis for predicting performance after discharge, 
was to ascertain whether or not such plans would be affected by the 
fact of separation. Since, however, soldiers remaining on active 
duty could not conveniently be followed up through separation, 
the test of sustaining their plans unchanged must be a compari- 
son between plans of men remaining in service and plans of sepa- 
ratees, making allowance for certain differences in the characteris- 
tics of the two groups. It was assumed that if men of a given age 
and education expressed plans for school in the same proportion 
whether remaining in service or being discharged, then a follow-up 
of men surveyed at the time of discharge might be used to evaluate 
the predictive value of plans expressed by all soldiers surveyed. 

Among men remaining in service in the continental United States 
in August 1945, 13 per cent expressed definite plans to return to 
full-time school or college. Among separatees surveyed in July 
1945, at four widely scattered separation centers, 3 per cent ex- 
pressed definite plans to return to full-time school or college. But 
the separatees were predominantly older men who had been out of 
school for some time before entering the service. The men remain- 
ing in service in the continental United States, on the other hand, 
included a large proportion of younger men who had been inducted 
directly from school, and many of these men planned to return. 
For men of comparable age and previous education, there was no 
significant difference between the school plans of separatees and 
those of men remaining in service. 

By December 1945, approximately midway in demobilization, 
younger men without dependents were being discharged in higher 
proportion than was the case five months earlier. Among Decem- 
ber separatees surveyed at five northeastern separation centers, 
slightly over 6 per cent expressed definite plans to attend full-time 
school. And, parenthetically, it may be stated that a War Depart- 
ment survey of enlisted men discharged in July 1946 found nearly 
10 per cent expressing definite plans to attend full-time school. 

Thus, it appears that there was considerable stability in soldiers’ 
plans for full-time school, and that the fact of separation did not 
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markedly change men’s plans in this respect. Therefore, it was 
relevant to inquire into the extent to which the expression of such 
plans at the time of discharge turned out to be predictive of efforts 
to attend full-time school. 

Table 4 presents the classification of plans for full-time enroll- 
ment in school or college among the two sample groups of separatees 
who were followed up by mail to ascertain the extent to which they 
attempted to carry out their plans in the first two to four months 
after discharge. As earlier reported, data on their postseparation 
performance were secured from nearly 92 per cent of the July sepa- 
ratees and 88 per cent of the December separatees for whom legible 


TABLE 4 


Pians For Fuuiti-tTime ScHooL ENROLLMENT EXPRESSED aT TIME OF 
Discoarce sy Waitp Enuistep MEN 


(July and December 1945) 


(Per cent) 

July December 

Plans for full-time enrollment separatees separatees 
All separatees 100.0 100.0 
Definite plans to enroll 3.1 5.7 
Tentative plans to enroll 1.8 2.7 
Considering enrolling 3.3 4.9 
No expressed plans to enroll 91.8 86.7 
Number of cases 1,812 2,477 


addresses were available at the start of the follow-up. There was 
no educational bias in returns received from July separatees, but 
among December separatees there was a somewhat higher rate of 
return from high school graduates and college men than from those 
of lower educational attainment. In the data to be presented, all 
percentages are based on the total number of separatees actually 
contacted by the follow-up. Therefore, those percentages relating 
to plans of separatees may occasionally differ slightly from those 
reported for all separatees prior to the follow-up. 


Postseparation Performance 


Both for July and for December separatees, the number in school 
or college within three to four months after discharge was very close 
to the number who had expressed definite plans to attend school. 
But these were not all the men who could be expected to attend. 
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Three to four months was not a long enough interval to permit all 
men planning to enroll in school to carry out their plans. This was 
particularly true of the December dischargees, for whom an unfore- 
seen contingency had developed. The limiting factor on their car- 
rying out their plans was not the availability of government aid, 
but of space in the schools they wished to attend. 

Many men were unable to enroll in the colleges they had planned 
to attend, largely because so many veterans were choosing the 
“name” colleges. Others had to postpone their enrollment for 
personal reasons. In addition, there were, of course, a number of 
men who had planned to take extended vacations before returning 
to school, and for these men, likewise, the fact that they had not 
enrolled within three or four months after discharge could not be 
considered a failure of prediction. Therefore, for purposes of evalu- 
ating the extent to which plans for school were predictive of actual 
returns, the criterion of successful prediction could not be solely 
the fact of enrollment. 

Just as men could be ranked according to the plans for school 
which they expressed at the time of separation, they could also be 
ranked crudely according to the interest they had shown after dis- 
charge in enrolling or attempting to enroll in school. The following 
groups were distinguished in analyzing returns to the two follow-up 
surveys: 

1. Men who reported they were actually enrolled in school full- 
time. 

2. Men who reported that they definitely planned to enroll full- 
time and that they had already applied to the Veterans Adminis- 
tration for certificates of eligibility. 

3. Other men who reported that they definitely planned to enroll 
full-time (many of whom had applied to one or more schools or col- 
leges for admission). 

4. Men who said they might enroll full-time. 

5. All others, including those enrolled or planning to enroll part- 
time and those taking on-the-job training.® 

Table 5 shows for July and December separatees the proportion 
in school or interested in attending at the time of the follow-up sur- 
veys, two to four months after discharge. 


*It should be noted that veterans interested in education or training could receive 
aid for part-time schooling or for on-the-job training as well as for full-time schooling. 
In so far as veterans who had planned to attend full-time school shifted their interesta 
to a part-time or job training course, they are here treated as complete errors in pre- 
diction. 
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The proportion of separatees who should be classified as having 
made a serious effort to attend full-time school probably lies some- 
where between the number who were either already enrolled or who, 
planning definitely to enroll, had applied for aid, and the total num- 
ber who said they were definitely planning to enroll (in other words, 
between 4.9 and 7.4 per cent for July separatees and between 6.9 
and 10.0 per cent for December separatees).? Comments written 
in a number of the questionnaires of men who said they were plan- 
ning to enroll but had not yet applied for certificates of entitlement 
indicated that the men were making an all-out effort to arrange for 
enrollment with the school before applying to the Veterans Admin- 
istration for a certificate. 


TABLE 5 


PosTsEPARATION INTEREST IN FuLi-Time EpucaTion RgEportTEep 
BY Separatens SuRVEYED Two To Four Monrss 
AFTER DISCHARGE 


(Per cent) 
July December 
Postseparation interest in full-time education separatees separatees 
Total 100.0 100.0 
Actually in school 3.0 4.7 
Definitely planning to enroll 4.4 5.3 
Had applied for aid (1.9) (2.2) 
Had not applied for aid (2.5) (3.1) 
Considering enrolling* 8.7 1.8 
No expressed interest in full-time school 83.9 88.2 


* July and December figures based on different criteria. July figure probably includes a good many 
men actually interested in part-time school. 


The number of men with definite plans for school had been re- 
garded as the best estimate of the number who would actually at- 
tempt to return. Both follow-up surveys, however, indicate that 
the number who had returned or seemed likely to return as of three 
months after discharge was nearer to the total number who had 
expressed either definite or tentative plans for school. 

It is apparent that in so far as the prediction problem is defined 
as estimation of the proportion of separatees who would seek to 
enroll in full-time school within a year of discharge from the service, 
only two observations are available to evaluate the use of plans for 
making such a prediction. Further, there is little to choose between 


* This assumes that relatively few men had made an unsuccessful effort to enroll and 
had thereupon given up the idea of returning to school by the time of the follow-up. 
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the accuracy of the prediction for July separatees and that for De- 
cember separatees. All that can be said is that in two instances 
where plans of separatees were used as a basis for estimating school 
enrollment of groups of separatees, the estimates seem to have been 
reasonably accurate if based upon the proportions with either defi- 
nite or tentative plans to enroll. 

Even at the time of final preparation of this volume, there is not 
yet available any definitive estimate of school attendance by veter- 
ans. As of December 31, 1946, the Veterans Administration re- 
ported approximately 1,680,000 veterans enrolled in institutional 
education under either Public Law 346, the Servicemen’s Readjust- 
ment Act of 1944, or Public Law 16 providing educational benefits 
for disabled veterans. It may be assumed, from earlier sample 
surveys made by the Veterans Administration, that roughly seventy 
per cent of the veterans enrolled in schools and colleges are full-time 
students. Thus, one would obtain an estimate of 1,175,000 full- 
time students by the end of 1946, roughly six months after the com- 
pletion of demobilization of the great bulk of the wartime armed 
forces. The figure is not markedly different from that estimated 
more than a year previously on the basis of the surveys conducted 
in the summer of 1945—“‘It seems likely that within six months to 
one year after total demobilization not less than one million men 
will have enrolled for full-time study in educational institutions.” 1 
It would appear, then, that so far as concerns the objective of esti- 
mating the proportion of veterans who would seek to enroll in 
school, surveys of soldiers’ plans and intentions gave estimates rea- 
sonably accurate for general planning purposes under the conditions 
that actually prevailed during and subsequent to demobilization. 
It must be recognized that if a different set of circumstances had 
prevailed, the predictions might have been made less accurate. 
Had there been no increase in the amount of aid available to veter- 
ans in December 1945, the proportion enrolling in schools and col- 
leges might have been considerably less—perhaps no more than 
the proportion who had expressed definite plans to enroll in school. 
On the other hand, if there had been eight million unemployed, as 
predicted by a number of leading economists for the immediate 
postdemobilization period, it is quite likely that the number of vet- 
erans seeking to enter schools and colleges would have been far 
greater. In any case, the limited facilities of our college system 


10 Information and Education Division, Report No. B-174, “Soldiers’ Plans for Full- 
time School or College,” December 28, 1946. 
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have undoubtedly held down to some extent the volume of enroll- 
ment which might have been attained if “‘name’’ colleges had been 
able to accept all qualified applicants. 

But what of the accuracy of predictions made for the individual 
veteran on the basis of his plans? 


Prediction of Enrollment for the Individual Separatees 


Among respondents to the follow-up of July separatees, less than 
a third of those who had expressed definite plans to attend school 
had actually enrolled within the first few months after discharge 
and another fifth reported that they still planned to enroll. Men 
who had expressed only tentative plans to attend school apparently 


TABLE 6 


RELATIONSHIP OF PRESEPARATION PLANS FOR EpucaATION TO ENROLLMENT 
og IntEeREsT IN Foii-Tuam ScuHoot Reporrep Two ro Four 
MONTHS AFTER SEPARATION 
(July Separatees) 

(Per cent) 


PLANS FOR FULL-TIME ENROLLMENT AT TIME 
OF DISCHARGE 
POSTSEPARATION INTEREST 
IN FULL-TIME SCHOOL Definite Tentative Considering None 


plans plans enrolling expressed 

Total 100 100 100 100 

In school 31 32 16 1 
Definitely planning to 

enroll 21 26 9 3 

Had applied for aid (13) (10) (2) (1) 

Had not applied (8) (16) (7) (2) 

Considering enrolling 15 29 20 8 

No expressed interest 33 13 55 88 

Number of cases 2 81 56 1,609 


returned in about the same proportion as those who had been more 
definite. (The number of cases in each of these groups is so small 
as to make comparison exceedingly hazardous.) Further, about a 
fourth of the men who had indicated they were considering attend- 
ing full-time school, and about 4 per cent of those who had not, at 
the time of separation, expressed any plans to attend, had either 
enrolled full-time or had become definite prospects for full-time 
education when surveyed two to four months after their discharge 
from the service (Table 6). 
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At the time the July (1945) separatees were returned to civilian 
life, there was still a great demand for workers in war industry. A 
number of the men indicated that they planned to take such a job 
during the period of emergency and then expected to return to full- 
time school. The war ended only a month later. It is not surpris- 
ing that some of these men who had not predicted they would return 
to school immediately after discharge should actually have done so 
under the changed circumstances. 


TABLE 7 


RELATIONSHIP OF PRESEPARATION PLANS FOR EpucaTION TO ENROLLMENT 
og Inrerest 1n Foui-Time Scooot Reportep THREE To Four 
Monrss AFTER SEPARATION 
(December Separatees) 
(Per cent) 


PLANS FOR FULI-TIME ENROLLMENT AT TIME 
OF DISCHARGE 
POSTSEPARATION INTEREST 


IN FULL-TIME BCHOOL Definite Tentative Considering None 
plans plans enrolling expressed 

Total 100 100 100 100 

Tn school 46 16 8 1 
Definitely planning to 

enroll 32 36 21 2 

Had applied for aid (14) (18) (8) * 

Had not applied (18) (18) (15) (2) 

Considering enrolling 2 8 4 2 

No expressed interest 20 40 67 95 

Number of cases 126 62 103 1,808 


* Less than .5 per cent. 


The follow-up of December (1945) separatees showed a somewhat 
closer relationship between plans for education and postseparation 
performance. It will be seen (Table 7) that a clear gradation exists 
in the extent of returns to school by plans classification—definite, 
tentative, etc. Not only did a higher proportion of the men with 
definite plans tend to carry out those plans but a lower proportion 
of those who had noé planned to return to school actually did return. 

Tables 6 and 7 indicate that, although a certain proportion of men 
who had not expressed at time of discharge either definite or tenta- 
tive plans to enroll in school actually did enroll, in general one would 
predict enrollment only for those individuals who had expressed 
definite or tentative plans. The relationship between these cate- 
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gories of soldiers’ plans and postseparation performance is summa- 
rized in Table 8. The definition of performance was to be “an at- 
tempt to enroll in school.”’ Two criteria of performance are used in 
Table 8: Criterion A—men who reported themselves either in school 
or as planning to enroll full-time and as having applied for educa- 
tional benefits; Criterion B—same as A plus all other men who re- 
ported definite plans to enroll full-time when surveyed after separa- 
tion. In general, it is felt that Criterion B more nearly represented 
the group likely to enroll or make an effort to enroll during the first 
six months to a year after discharge. 


TABLE 8 


Puans ror Four-Tiwe Scooo, Revative to Two Crireria oF ENROLLMENT 


(July and December Separatees) 
(Per cent) 


POSTSEPARATION PERFORMANCE 
PLANS TO ATTEND Criterion A Criterion B 


FULL-TIME SCHOOL 
Total Enrollees* Others Total Enrollees** Others 


July Separatees 

Total 100.0 4.9 96.1 100.0 7.4 98.6 
Definite 3.1 1.4 1.7 3.1 1.6 1.5 
Tentative 1.8 0.8 1.0 1.8 1.1 0.7 
All others 95.1 2.7 92.4 95.1 4.7 90.4 
December Separatees 

Total 100.0 6.9 98.1 100.0 10.0 90.0 
Definite 5.9 3.6 2.3 5.9 4.6 1.3 
Tentative 3.0 1.0 2.0 3.0 1.5 1.5 
All others 91.1 2.3 88.8 91.1 3.9 87.2 


* Includes those actually enrolled and thoee planning to enroll full time who had applied for aid. 
*™ Includes those actually enrolled and all those planning to enroll full time. 


Table 8 clearly indicates that in two respects the plans expressed 
by December separatees were more highly predictive of postsepara- 
tion performance than were plans of July separatees: (1) among 
those who expressed plans to go to school, a higher proportion car- 
ried out or sustained such plans; (2) among those who actually en- 
rolled or seemed likely to enroll on the basis of follow-up data, a 
higher proportion consisted of individuals who had reported plans 
to return to school. 

It is of interest to inquire into the reasons for the greater degree 
of association between plans and performance of December sepa- 
ratees as compared with July separatees, and also to inquire in gen- 
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eral into the reasons why individual predictions based on soldiers’ 
plans to attend full-time school were not more often correct. In 
doing so, it is convenient to consider three aspects of the prediction 
problem : 

1. The adequacy of the classification of soldiers’ plans to attend 

full-time school. 

2. The characteristics of the members of the population surveyed. 

3. The situation encountered after discharge. 

Each of these aspects will be briefly discussed in the pages that 
follow. 

Adequacy of plans classification. Evaluation of the adequacy of 
the classification of soldiers’ plans used as a basis for the prediction 
of enrollment for full-time study in school or college involves a re- 
consideration of the structuring of the prediction problem, an 
examination of the predictive value of specific items used in the 
questionnaire and of the method whereby such items were com- 
bined and, finally, consideration of alternative approaches. 

It will be recalled that in the major surveys of soldiers’ plans, 
classification of plans for education was based upon responses to 
three items which had been used to define major scale types in ear- 
lier pretests and surveys. The selection of only three items as a 
basis for classification sacrificed one of the chief advantages of the 
use of a scale—that of permitting ready classification of respondents 
who failed to answer one or two questions in a battery or answered 
one or two questions in such a way as to suggest misinterpretation 
or erroneous checking. On the other hand, the use of a long battery 
of questions on educational plans in a questionnaire relating to a 
large number of fields which veterans might wish to enter was not 
deemed feasible. Indeed, a long battery of questions which more 
or less repeated the same inquiry might be expected to produce 
more erroneous checks and failures to answer. And since the items 
selected were items which produced the scale pattern with relatively 
low degree of error, categorization based in part on consistency of 
response (rather than being based entirely on scaling to type) did 
not markedly reduce the size of the groups classified as having defi- 
nite or tentative plans to attend full-time school. At the same time, 
the experience gained with the longer set of items used in the earlier 
survey of educational plans and in pretests suggested those combina- 
tions of responses that should be made in attempting to arrive at a 
small number of categories for the classification of those men some- 
what interested in further education but not really planning to be- 
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come full-time students after discharge. How important were 
these categories in their contribution of veteran students? 

Tables 6 and 7 indicated that the proportion of veterans who re- 
turned to full-time school or were considering returning at the time 
of the follow-up, from among all separatees who had not expressed 
any intentions of becoming full-time students, was small—4 per 
cent among July separatees and 3 per cent among December sepa- 
ratees. But the contribution to total enrollment of veterans made 
by separatees who had not planned to return to school was never- 


TABLE 9 


DisTRisvuTion oF ToraL SAMPLE AND OF VETERANS ENROLLED OR PLANNING 
To ENROLL IN FuLt- of Part-Time ScHoon, sy EpucaTioNaL PLans 
Expressep aT DiscHARGE 


(December 1945 Separatees) 


(Per cent) 
TOTAL PART-TIMB 
SAMPLE FULL-TIME STUDENTS STUDENTS 
PLANS EXPRESSED AT TIME In school 
OF DISCHARGE In Planning or planning 
school to enroll to enroll 
Total 100 100 100 100 
Full-time school 14 76 75 19 
Definite plans (6) (58) (35) (6) 
Tentative plans (3) (10) (20) (5) 
Considering enrolling 
full-time (5) (8) (20) (8) 
Plan to enrol! part-time 11 6 12 46 
Vague interest in education 
but no plans indicated 26 9 7 19 
No expressed interest 49 9 6 16 
Number of cases 2,098 99 112 118 


theless considerable because of the magnitude of this segment of 
the population relative to the number who had planned to enroll. 
Table 9 shows in detail the contributions made to the total full-time 
and part-time enrollment or prospective enrollment of December 
separatees by the various categories of educational plans expressed 
at the time of discharge. It is apparent that in the case of both full- 
time and part-time enrollment relatively few students, or potential 
students, were drawn from that half of the sample which had ex- 
pressed no interest in education at the time of discharge. Of full- 
time enrollees, in fact, three fourths were contributed by the 14 per 
cent of the sample who had indicated that they were either planning 
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to attend school full-time or at least were considering full-time 
enrollment at the time of discharge.!! Nevertheless, it is pertinent 
to inquire into the general plans classification of those men who 
were categorized as having plans for part-time school or as having 
merely vague interest, or even no expressed interest, in school yet 
who reported themselves in school or planning to enroll some three 
or four months after discharge. Were these men undecided as to 
their plans at the time of discharge, or did they have other plans 
which were modified by the experiences of the first few weeks of 
civilian life? The latter seems very largely to have been the case. 
In answer to questions involving alternative plans, these men had 
indicated in large proportion that they planned to work as employ- 
ees. Some had expected to return to work for the employer they 
had been with before the war; a larger number had expected to find 
anew job. Unfortunately, the number of cases involved is too few 
to permit any adequate analysis of the reasons for the shift which 
occurred in the plans of these men, or of their characteristics. 

What estimates as to probable enrollment of veterans as full-time 
students would have been made if responses to any one of the indi- 
vidual questions relating to plans for school had been taken as the 
basis for plans classification? 

Let us consider first the two questions which served as the basis 
for establishing the cutting points which defined the group described 
as having definite plans and that described as having tentative plans 
—the question involving choice of full-time school among several 
alternatives (Q 22) and the direct question relating solely to plans 
for school attendance (Q 39). It is to be noted that responses to 
the separate items will give different estimates from those derived 
from the use of the questions in a scale in so far as the items show 
other than a random distribution of scale error. Since the three 
questions used for a basic classification of separatees’ school plans 
were too few to permit scoring nonperfect types in accordance with 
scale theory, complete consistency of response had to be the crite- 
rion of classification as having definite plans. It will be apparent, 
therefore, that the question involving choice of alternatives, used 
to define the type called definite planners, actually yielded a higher 
estimate of school enrollment when used alone than when used in 

Tt is noteworthy that although nearly half of the part-time enrollees were drawn 
from the 11 per cent of the December sample who expressed plans for part-time school, 
more than a third of the part-time enrollees come from the three fourths of the sample 


who expressed little or no interest in education at the time of separation. The problem 
of predicting part-time education will be discussed briefly later in this section. 
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conjunction with other questions (since those not consistent in their 
expressed plans would be eliminated). The estimates obtained 
from this source, and their relationship to performance, are shown 
in Table 10.12 From a comparison of Table 10 with Table 8 (using 
Criterion B), it will be noted that a prediction based upon question 
22 would be higher than that yielded by the classification ‘‘definite 
plans’ but slightly lower than the combined ‘‘definite’”’ plus “‘ten- 


TABLE 10 
RELATIONSHIP OF RESPONSES TO INITIAL PRESENTATION OF ALTERNATIVES 
iN Survey oF PLans AND PosTsEPaRATION ScHooL ENROLLMENT 
Criterion B—July and December Separatees) 
(Per cent) 


Question 22: Regardless of whether you have definite plans or not, which one of 
these things do you think you will most probably do FIRST—right 
after you leave the Army? 


STATUS AT TIME OF FOLLOW-UP 


July separatees December separatees 
RESPONSE In school In school 
CATEGORY or planning All or planning All 
CHECKED Total to enroll others Total to enroll others 
Total 100.0 7.4 92.6 100.0 10.0 90.0 

Work for an 

employer* 85.4 5.3 80.1 82.2 4.4 77.8 
Go to full-time 

school or college 4.7 2.1 2.6 8.1 5.3 2.8 
Run own business 

or farm 8.4 _ 8.4 7.6 0.2 7.4 
No answer 1.5 _— 1.5 2.1 0.1 2.0 


* Includes those who planned to work in family businesses or on family farms. For exact wording of re- 
sponse categories, see Appendix A to Chapter 16. 


tative’ plans categories. Accepting Criterion B as the best esti- 
mate of actual enrollment after discharge, it would appear that the 
higher prediction given by the combined ‘‘definite” and ‘‘tentative”’ 
categories using three questions would be a closer approach to the 
mark than the prediction based on question 22 alone. It is to be 
noted, moreover, that when the ‘definite’ and ‘“‘tentative” cate- 
gories of plans in Table 8 are combined, not only is a higher predic- 
tion of attendance given but a higher proportion of the group desig- 


12 The question wording used in the surveys of separatees will be seen to differ some- 
what from that used in the earlier surveys, but the fundamental relationships of the 
questions themselves were unchanged. The question numbers reported are from the 
survey of separatees and will serve as guidance to the reader who wishes to consult 
the questionnaire itself, contained in Appendix A to Chapter 16. 
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nated as likely to enroll actually does so. Thus, of the 4.7 per cent 
of July separatees who chose the alternative ‘‘go to full-time school 
or college” in answer to question 22, 2.1 or less than half (45 per 
cent) were in school or planning to enroll as of three to four months 
after discharge, but of the 4.9 per cent classified as having definite 
or tentative plans by the modified scale, 2.7 or slightly over half (55 
per cent) were in school or seemed likely to enroll. Again, among 
December separatees the choice of school in question 22 by 8.1 per 
cent of the group was a valid prediction of attendance for about 65 
per cent of those who made the choice; but of the 8.9 per cent 
classified as having definite or tentative plans, 69 per cent actually 
qualified as students by the same criteria." 

As a basis for prediction of individual enrollment, then, the ques- 
tion involving a choice of alternatives would seem to be less satis- 
factory than its use in combination with the other scale items used. 

Reference to the role of the direct check-list question on plans for 
school (Q 39) in the classification of plans indicates that it gave the 
cutting point between those men classified as having definite or 
tentative plans to attend school and all others. Table 11 indicates 
that the use of this question alone differs hardly at all from the 
use of the three items selected to define scale types if the types 
are dichotomized into “definite or tentative planners” versus ‘‘all 
others.” It should not be inferred, however, that the single 
direct question provides an adequate basis for classifying soldiers’ 
plans for school. Reference to Table 7 clearly indicates that De- 
cember separatees classified as having definite plans to attend school 
actually enrolled in considerably higher proportion than did those 
with only tentative plans. If conditions had been less favorable 
to school enrollment of veterans, it might well have been that few 
except those classified as having definite plans actually would have 
enrolled. 

It had been hoped that by inserting at the close of the question- 
naire a second opportunity to indicate choice among alternatives it 
might be possible to get a more reasoned choice and hence a better 
basis for prediction than would be provided by the earlier question 
involving alternatives (Q 22). Such did not prove to be the case, 


14 The differences are not statistically significant if the high degree of correlation be- 
tween membership in the two categories is ignored, but that for July separatees exceeds 
the 2¢ level if one assumes a .5 correlation. 

\ The source of the small difference that does occur is the fact that a few of the men 
who said they planned to go to full-time school in answer to question 39 had not indi- 
cated that they wanted to go to full-time school in answer to question 38. 
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however. For each of the samples of separatees, the number of 
men checking that they planned to attend school was slightly less 
for the second question involving alternatives than for the first, but 
the predictive value of the choices was not appreciably greater, as 
will be seen by comparison of the data given in Table 12 with the 
previously reported data for question 22 (Table 10). Table 12 
also gives some detail on the plans of men who were at the time of 


TABLE 11 
RELATIONSHIP OF RESPONSES TO DrikECT QUESTION ON ScHoo, PLANS 
AND PosTSEPARATION ScHOOL ENROLLMENT 
(July and December Separatees) 
(Per cent) 


Question 39: Regardless of what you would like to do, do you think you will actually 
go back to schoo! or college after you leave the Army? 


STATUS AT TIME OF FOLLOW-UP 


July separatees December separatees 
RESPONSE In school In school 
CATEGORY or planning All or planning All 
CHECKED Total to enroll others Total to enroll others 
Total 100.0 7.4 92.6 100.0 10.0 90.0 

Yes, to full-time 

school or 

college 5.4 2.8 2.8 9.1 6.2 2.9 
Yes, to part-time 

school or 

college 14.5 2.0 12.5 13.6 1.4 12.2 
No, I don’t think 

I will go back 

to school or 

college 52.4 1.0 51.4 47.5 0.6 46.9 
Undecided 20.5 1.4 19.1 22.9 1.6 21.3 
No answer 7.2 0.2 7.0 6.9 0.2 6.7 


the follow-up either enrolled in school or planning to enroll although 
they had not at the time of separation expressed plans to enroll. 

By following the final choice of alternatives with a question ask- 
ing for degree of certainty of plans, it was possible to distinguish 
between those men who felt quite sure they would enroll in school 
and those who, while leaning toward school enrollment, were some- 
what less sure.'* Table 13 shows for December separatees the ex- 
tent to which feeling of certainty was related to the carrying out of 
plans. 


4 See question 69 in the questionnaire for separatees appended to Chapter 16. 
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TABLE 12 


RELATIONSHIP OF RESPONSES TO FINAL PRESENTATION OF ALTERNATIVES 
tn Survey oF PLANs AND POSTSEPARATION ScHOOL ENROLLMENT 


(July and December Separatees) 
(Per cent) 


Quxstion 68: In this questionnaire we have asked you a lot of questions about your 
plans for after your discharge. Now that you have thought them over 
a little more, try to take everything into consideration and check the 
one thing you will most probably do first of a—after you have gotten 
your discharge and taken a vacation. 


STATUS AT TIME OF FOLLOW-UP 


July separatees December separatees 
RESPONSE Tn school In school 
CATEGORY or planning All or planning All 
CHECKED* Total to enroll others Total to enroll others 
Total 100.0 7.4 92.6 100.0 10.0 90.0 

Work for pre- 

service 

employer 25.6 0.3 25.3 46.8 1.6 45.2 
Work for some 

other employer 48.7 3.9 44.8 24.7 2.0 22.7 
Go to full-time 

school 4.7 2.4 2.3 7.4 4.9 2.5 
Go into business 

for self 5.4 0.1 5.3 5.6 0.2 5.4 
Farm 5.2 0.1 5.1 3.3 0.2 3.1 
Undecided 6.0 0.4 5.6 6.5 0.7 5.8 
No answer 4.4 0.2 4.2 5.7 0.4 5.3 


* For exact wording of response categories, see Appendix A to Chapter 16. 


TABLE 13 


Crrralnty or PLans Expressep sy CHOICE AMONG ALTHSRENATIVES (QUESTION 68) 
AND POSTSEPARATION ENROLLMENT IN FuLi-TIMBp ScHOOL 


(December Separatees) 
(Per cent) 


STATUS AT TIME OF FOLLOW-UP 


PLANS CLASSIFICATION In school 

or planning AU 
Total to enroll others 

Total 100.0 10.0 90.0 
Full-time school 74 4.9 2.5 
Sure of plans (4.5) (3.7) (0.8) 
Less sure (2.9) (1.2) (1.7) 
Alternative plans 92.6 5.1 87.5 
Sure of plans (43.6) (2.1) (41.5) 


Less sure (49.0) (3.0) (46.0) 
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Of men who had indicated that they felt sure of their plans to 
enroll in school after discharge, 84 per cent actually enrolled or 
seemed likely to enroll; of those who felt less sure, only 42 per cent 
had enrolled or sustained their plans to enroll. Here, then, was a 
method of further subdividing the group classified by the other 
items used as having definite plans, and a method which did give 
considerable discrimination judged by actual performance. In so 
far as interest is focused upon accurate individual prediction, the 
combination of choice of alternatives and degree of certainty of 
choice definitely yields a gain. It will be apparent, however, that 
with conditions favorable to school enrollment by veterans, the 
choice of alternatives alone seriously understates the prospective 
returns to school by classifying under other rubrics those men seri- 
ously considering school enrollment but leaning somewhat more 
strongly to some other plan at the time of discharge. Further dis- 
cussion of the general structuring of the prediction problem and the 
consideration of alternative approaches will be deferred until after 
a brief analysis of the relationship of personal characteristics and 
degree of carrying out plans for school attendance. 

Population characteristics and prediction of enrollment. The great 
majority of soldiers who expressed plans to attend school after dis- 
charge were young single men who had been out of school a relatively 
short time when they were inducted into the service. Even among 
men discharged from the Army in December 1945, although single 
men under twenty-five years of age and out of high school a year or 
less before induction numbered only 10 per cent, they contributed 
56 per cent of the group with definite plans to attend school and 
31 per cent of those with tentative plans. 

A question much discussed in attempting to arrive at estimates 
of veteran enrollment was whether older married men and those 
longer out of school at the time of discharge were at all likely to 
return to school, even if they did have plans to go. It might be 
expected that those lacking the usual characteristics of a school 
population would run into more problems and difficulties in attempt- 
ing to become students than would the young single group differing 
only by a few years of age from the normal college population. 
Such reasoning apparently resulted in limiting educational benefits 
of the original Servicemen’s Readjustment Act of 1944 to veterans 
under twenty-five and those whose education was interrupted by 
the war. On the other hand, it seemed possible that the older or 
married men and those longer out of school might well be more 
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strongly motivated toward further education if they formulated 
plans for enrollment in the face of the difficulties to be anticipated. 

Examination of the results of the follow-up of December sepa- 
ratees indicates that, among the older or married men and those out 
of school more than a year before they entered the service, the pro- 
portion carrying out or sustaining their plans in the first few months 
after discharge was less than that of the younger single group more 
recently from school when inducted. It will be seen in Table 14 
that for each category of school plans, the group having the char- 
acteristics usually associated with student status was more likely 


TABLE 14 


PosTsEPARATION Returns To ScHoo. BY PLans Exrressrep at SEPARATION 
aMoNG SINGLE MEN UNDER AGE TWENTY-FIVE ouT oF SCHOOL 
4 YraR orn Luss, AND AMONG OTHER SEPARATEES 
(December 1945 Separatees) 
(Per cent) 


SINGLE MEN UNDER 25, OUT 


STATUS THREE OF SCHOOL ONE YEAR OR LESS ALL OTHERS 
TO FOUR 
MONTHS AFTER Definite Tentative Definite Tentative 
DISCHARGE Total plans plans Other Total plans plans Other 
Total 100 100 100 100 100 100 100 100 
In school or college 25 «64 21 8 2 35 14 1 
Planning to enroll 20 8629 32 13 4 36 35 2 
Had applied for aid (9) (16) (16) (3) (2) (18) (19) (1) 
Had not applied (11) (18) (16) (10) (2) (28) (16) (1) 
Not planning to enroll 55 17 47 79 94 29 51 97 
Number of cases 211 70 19 188 | 1,887 56 481,789 


Percentage constituted 
among all veterans 
in school or planning 
to enroll 45 28 5 12 65 18 10 27 


to return to school. Nevertheless, because they comprised so large 
a proportion of the total sample, the group less likely to return did 
contribute a majority of the students from among December sepa- 
ratees. (The data are contained in the bottom line of Table 14.) 
One reason for the higher predictive value of plans expressed by 
the December separatees as compared with July separatees seems 
to have been the differential proportion of men having the charac- 
teristics usually associated with school performance. Among the 
July separatees only about 5 per cent were single, under twenty-five, 
and out of school less than a year before induction, and of these, a 


STUDIES OF POSTWAR PLANS 619 


considerable number had entered the Army during peacetime so 
that their education had not in any sense been interrupted by the 
war. 

In summary, it appears that plans for postseparation school en- 
rollment were more highly predictive of enrollment of veterans 
having characteristics most similar to those of a school population 
than of enrollment of older veterans, married veterans, or those who 
had left school more than a year before they entered the service. 
On the other hand, among the younger, unmarried veterans out of 
school only a short time before they entered the service, those who 
had not planned to return to school after discharge were more likely 
to shift in the direction of enrolling, so that the overall accuracy of 
prediction from plans alone was not appreciably different from that 
obtaining for the older men, those married and those longer out of 
school. It would appear that those having the characteristics most 
similar to a school population were, upon discharge, most likely to 
encounter a situation which they defined as favorable to enrollment. 

The situation encountered after discharge. It was earlier observed 
that implicit in the plans expressed by soldiers and separatees was 
an evaluation of various contingencies relating to the situation to 
be encountered after discharge. Plans expressed in the summer of 
1944 on the basis of an assessment of future contingencies as they 
appeared at that date might readily be shifted as a result of a con- 
tinuing reassessment of those contingencies. Specifically, the man 
whose plans for school in the summer of 1944 rested on the convic- 
tion that he would be demobilized within a year and could start col- 
lege in the fall of 1945 might well decide to go to work upon learning 
a year later that he would be in service well into 1946. This would 
be particularly true if in that intervening year he acquired a wife 
or & prospective wife. 

The surveys of separatees indicated that there had not been any 
major change from July 1944 to December 1945 in the proportion 
of soldiers planning to enroll in school, although there had been a 
slight increase. Nevertheless it must be assumed that a number of 
soldiers decided not to attend and a slightly larger number decided 
to attend school as a result of changed evaluations of the situation 
to be encountered after discharge. Since the follow-up studies were 
conducted only among separatees, they do not afford adequate data 
on the magnitude of such changes. Among men discharged in 
July 1945, however, there is a clear indication that within two 
months after discharge a considerable number of veterans reevalu- 
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ated the opportunities available to them and shifted their plans 
accordingly. When they first learned that they would be among 
the first men demobilized, many veterans discharged at this time 
resolved to work in war industry, in order to get a chance at some 
of the high wages they had heard so much about while in service, 
and others decided to seek a good permanent job while the labor 
market still made great demands for workers. With the surrender 
of Japan, however, a number of these men decided to go to school 
instead. It is quite possible that such men would have expressed 
plans for school if surveyed a year earlier, and that the termination 
of the war thus resulted in a return to plans previously modified by 
prospects of war industry employment. The fact remains that the 
changed situation seems to have resulted in a more serious underesti- 
mate of school enrollment of July separatees than was true of De- 
cember separatees. 

It has already been mentioned that the increase in allowances to 
veterans attending schools, provided by amendment in December 
1945 to the earlier Readjustment Act, probably increased slightly 
the number of veterans actually enrolling in schools and colleges, 
and the fact that many veterans had inflated notions of the sort of 
jobs available to them may also, after deflation of such notions, 
have resulted in some increase in enrollment. Countering this, 
crowding in the colleges, which absorbed about two thirds of the 
full-time enrollees, has probably limited somewhat the proportion 
of later separatees who were able to carry out their plans to enroll 
in school. 

In concluding this review of the attempt to predict the enrollment 
of veterans in schools and colleges from a knowledge of plans ex- 
pressed while in service, brief consideration is given to the relation- 
ship between plans to enroll in school on a part-time basis, and 
actual part-time enrollment within a few months after discharge. 
Slightly over 11 per cent of the December separatees had reported 
that they would like to attend part-time school and that they 
thought they actually would attend after discharge. As has already 
been indicated (Table 9), the men planning to attend part-time 
school comprised nearly half of those who were enrolled or expected 
to enroll at the time of the follow-up. But, as will be seen in Table 
15, less than a fourth of the men who had expressed plans for part- 
time school had actually carried out or sustained their plans. Fur- 
ther, those who had not carried out plans for part-time school were 
not counterbalanced by any substantial enrollment by men who had 
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not expressed plans to enroll. This situation had early been fore- 
seen in reporting soldiers’ plans. The fact that plans for part-time 
school might be compatible with almost any other activity meant 
that a considerable number of men might report their own wishful 
thinking. There was nothing in the questionnaire to force the sol- 
dier to evaluate his own plans for part-time school. 

Table 15 also indicates that of those veterans who did not sustain 
their plans for part-time school, an appreciable proportion shifted 
either to full-time school or to on-the-job training. Thus, it is to 


TABLE 15 


RELATIONSHIP BETWEEN VaRnious Catzcoriss oF Soupiers’ EpucaTionaL 
PLANS AND EpucaTIONaL oR JoB-TRAINING Stratus THREE To Four 
Monrss arrer DiscHaRGE 
(December 1945 Separatees) 

(Per cent) 


PLANS FOR EDUCATION EXPRESSED AT DISCHARGE 


Vague 
EDUCATIONAL STATUS Full-time Part-time interest, ' No 
AT TIME OF FOLLOW-uP Total achool* school no plans** = interest 

Total 100.0 18.8 11.4 25.6 49.8 

Full-time school 10.0 7.6 0.9 0.8 0.7 
In school (4.7) (3.6) (0.3) (0.4) (0.4) 
Planning to enroll (5.3) (4.0) (0.8) (0.4) (0.3) 
Part-time school 5.5 Ll 2.8 1.0 0.8 
In school (1.2) (0.5) (0.5) (0.1) (0.1) 
Planning to enroll (4.3) (0.6) (2.1) (0.9) (0.7) 

On-the-job training 11.6 1.2 1.9 3.3 5.2 
In training (4.6) (0.4) (0.8) (1.0) (2.4) 
Planning to enter (7.0) (0.8) (1.1) (2.3) (2.8) 


All others 72.9 3.9 6.0 20.4 42.6 


* Includes those with definite and tentative plans plus those considering enrolling full-time. 

* Includes those who said they would like to enroll but did not plan to do so. 
be noted that either an intensified interest in education, leading to 
full-time enrollment, or a reevaluation of training opportunities, 
leading to a choice of on-the-job training, would result in wrong 
prediction just as would a decline in interest leading to a discarding 
of all plans for education. 

To soldiers interested in further education there was available aid 
for full-time school, for part-time school, or for on-the-job training. 
Full-time school was most often chosen by young single men, par- 
ticularly by those who had been in school just before they entered 
the service. Part-time school appealed more often to the married 
men who felt they could not afford to return to school full-time, 
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and to single men who had been working before entering the service. 
Relatively few soldiers knew about the possibilities of on-the-job 
training until well along in the demobilization period.'® While the 
types of training for which aid was available are in some respects 
quite dissimilar, they did apparently serve in some instances as 
adequate substitutions, one for another, so that no real change of 
plans would be involved. This would be particularly true in the 
case of on-the-job training as a substitute for business or trade 
school, and part-time college as a substitute for full-time enrollment. 

One further note is relevant : the 1944 cross-sectional survey found 
that of soldiers definitely planning to enroll in full-time school, two 
thirds planned to enroll in college, about a fourth planned to enroll 
in trade and business school courses and less than 5 per cent planned 
to complete standard high school courses. Veterans Administra- 
tion operating statistics as of summer 1946 showed veterans enrolled 
in educational institutions to have almost identical distribution by 
type of school with that predicted on the basis of soldiers’ plans. 

16 As of July 30, 1945, Veterans Administration statistics showed 20,229 veterans in 
institutional education and 2,120 in on-the-job training. By July 30, 1946, the num- 


ber reported in institutional education had increased thirtyfold (to 652,762), but the 
number reported in on-the-job training had increased two hundredfold (to 403,199). 


CHAPTER 16 


STUDIES OF THE POSTWAR PLANS 
OF SOLDIERS—CONTINUED' 


SECTION I 


PLANS TO RETURN TO PREVIOUS EMPLOYER 


PPROXIMATELY four fifths of the Army’s male personnel had 
worked for an employer just before entering the service. The 
Selective Service Act of 1940 assured reemployinent rights to all 
personnel entering the Armed Forces after May 1, 1940, who were 
occupying nontemporary jobs in the civilian economy as of the time 
of induction or enlistment. Some men, however, had left their reg- 
ular jobs either to take jobs in war industry or to take vacations be- 
fore entering the service. Once the war was under way, many men 
entering war industry were replacing others who had enlisted or 
been inducted. 

There was no accurate knowledge of the number of men in service 
who had reemployment rights to their preservice jobs. Estimates 
generally ran between 20 per cent and 30 per cent and were based 
largely on guesswork, as the following quotation suggests: 

“When allowance is made for those veterans, now probably 40 
per cent of the total, who entered military and naval service without 
previously having had gainful employment, for the large number 
who will have no reemployment rights because they occupied tem- 
porary positions or were unemployed at the time of induction, for 
those whose jobs will be wiped out by business failures or techno- 
logical changes, for those so disabled that they will not be able to 
resume their old occupations, and for those jolted out of a customary 
way of life, whom new maturity, skills, or ambitions will tempt to 
greener pastures, the number of veterans with effective rights to 
reinstatement is estimated at about 20 per cent of the total.’ ? 

‘ By John A. Clausen. 


? Walter J. Couper, “The Reemployment Rights of Veterans,” Annals of the American 
Academy of Political and Social Science, Vol. 238 (March 1945), p. 112. 
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The estimate that 40 per cent of all military personnel had not 
previously been gainfully employed was at least double the actual 
figure, but there was no way of knowing how many veterans had 
occupied temporary jobs or how many desired to find new jobs. 
Moreover, many employers indicated that they would attempt to 
reemploy not merely those veterans who had legal rights to a job, 
but all veterans who had been in their employ at the time of induc- 
tion or enlistment. It seemed relevant, therefore, to inquire into 
the number of men in service who planned to return to work for 
their previous employers. 


Ascertaining Plans to Return to Previous Employer 


The problem of multidimensionality of plans was nowhere more 
difficult to deal with than in the area of plans to return to work for 
one’s previous employer. Some men wanted to return to the exact 
niche they had previously occupied ; others wanted to return to the 
same employer but in a different (and usually a better-paying) job; 
still others were interested in working in the same occupation but 
with a different employer, yet were considering the possibility of 
going back to work for their former employer if other opportunities 
were not readily available. For a large proportion of the Army’s 
personnel who had previously been gainfully employed, it appeared 
that the possibility of returning to work, at least temporarily, for 
one’s previous employer was not to be completely dismissed even 
though other pastures might seem greener. A survey conducted in 
December 1943 to elicit attitudes toward postwar vocational guid- 
ance included three questions relating to interest in doing the same 
or a different kind of work with the same or a different employer 
after the war. More specifically, the questions asked were: 


1. Whether the respondent would like to go back to his old job with his 
old employer, a different job with his old employer, the same kind of 
work with a different employer, or a different kind of work with a 
different employer. 

2. What kind of work the respondent would like to do after the war. 

3. What kind of work the respondent actually expected to do after the 
war. 


It was found that of men who had previously been employed, a 
slight majority (54 per cent) said they would like to do the same 
kind of work, and of this group, about three fourths said they would 
like to work for their previous employer. Among the 46 per cent 
of all previous employees who said they would like to do a different 
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kind of work, however, nearly half were doubtful as to whether they 
would actually do a different kind of work, and a fourth thought 
they would probably return to work for their previous employer. 
Clearly this was not a unidimensional problem. 

Considerable difficulty was encountered in attempting to phrase 
questions simply enough to be intelligible to the soldier with only 
fifth or sixth grade education yet precisely enough to specify the 
dimensions desired for classification. For example, the expression 
“go back to your old job after the war,” was interpreted by a ma- 
jority of soldiers as referring to a job with their last employer prior 
to induction, but some men took this expression to mean the same 
general occupation, regardless of employer, and some took it to mean 
precisely the same occupation with the same employer. The man 
who hoped to go back to a somewhat different job with his previous 
employer was frequently at a loss to know how he should classify 
himself. To get around this problem, it was decided to ask sepa- 
rate questions about type of work or occupation, and about prospec- 
tive employer. 

Another major difficulty was encountered in inconsistencies 
among responses to questions which were designed primarily for 
men who had been employed just before their induction. Such 
questions usually carried an answer category, “I was not working 
just before I entered the Army,” but some men who had not previ- 
ously been employed failed to check this category and instead 
checked that they planned to do a different kind of work than they 
had previously done. Other men, not employed just before induc- 
tion but employed at some earlier date, further confused the picture 
with respect to probable reemployment of those drafted or volun- 
teering for the service from their jobs subsequent to the spring of 
1940. To meet this problem it seemed desirable first of all to ascer- 
tain employment status prior to induction and then in this area to 
study the plans only of those men who had occupied full-time jobs 
prior to their induction. 

In the world-wide survey conducted in the summer of 1944, men 
were classified first as to whether or not they had been employed 
just prior to induction or enlistment, then by their judgment of 
whether they could get work with their previous employer and, 
finally, whether they expected actually to return. They were also 


* The questions asked on these topics were as follows: 
After the war, do you think you could get work with the company or person you 
worked for before you came into the Army, if you wanted to? (Check one) 
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asked to describe the kind of work they expected to do, and this was 
coded as “similar to’’ or “different from’’ the kind they had previ- 
ously done. These data were used as a basis for estimating the 
minimum number of soldiers counting on going back to their old 
jobs and the maximum number considering returning to work for 
their previous employers. The method of classification is described 
in the following quotation from the survey report: 

“‘Those enlisted men who say both that they plan to do the same 
kind of work and that they expect to return to their old jobs may be 
considered a minimum estimate of the proportion definitely count- 
ing on going back to their old jobs. These comprise a fourth of the 
white enlisted men who worked as employees before they entered 
the Army. Those men (fourteen per cent of foriner employees) 
who are planning to return to the same kind of work and who say 
they may return to their former employers also seem more likely to 
try to get their old jobs back than to make a change. The estimate 
of the maximum number of men who are considering returning to 
their former employers includes all other men (twenty-one per cent) 
who say that they may go back to work for the employer they 
worked for just before they entered the Army. Some of these men 
are undecided about their postwar plans, and many of them are seri- 
ously considering alternative plans—a different type of work for an 
employer, full-time school, self-employment, etc.—but they do not 
feel sure enough about the alternatives to rule out going back.” 4 

It appeared likely then, on the basis of their plans, that about 40 
per cent of the soldiers who had previously been employed might 
return to work for their former employers. There was some doubt, 
however, as to the desirability of using the dimension “type of work”’ 


——— Yes, I’m almost sure I could 
——— Yes, I think so, but I’m not sure 
No, I probably couldn’t 
—_—— I’m sure I could not 
I worked for myself before I came into the Army 
I was not working before I came into the Army 
Do you think you actually will go back to work for the same employer (company, 
person, etc.) you worked for before you came into the Army? (Check one, 
Yes, I’m quite sure I will 
——_—~ I may, but I’m not sure 
No, I don’t think I will 


I worked for myself before I came into the Army 
I was not working before I came into the Army 

‘Information and Education Division Report No. B-129, Post-War Occupational 
Plans of Soldiers, March 1, 1946, p. 18. 
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as a criterion for estimating probable extent of return to previous 
employer. A man might desire a change in occupational category 
yet primarily be intent on securing a job with his old employer. 
Subsequently, therefore, the use of this dimension was discontinued 
and a different criterion for distinguishing between probable re- 
turnees and others was established. 


Plans of Separatees in July and December 1945 


The questionnaire administered to separatees used the same basic 
questions for a classification of plans to return to preinduction em- 
ployer with but one modification of any importance: in answer to 
the question, ‘‘Do you think you actually will go back to work for 
the same employer . . .?”’ the previously used response category, 
“TI may, but I’m not sure’’ was replaced by two new categories: “Yes, 
I probably will” and “I may, but I probably will not.” It was 
hoped that under conditions favorable to prediction from stated 
plans, this change would permit discrimination between one group 
of which more than half the members would actually return and 
another group of which somewhat less than half would return. On 
the assumption that errors of individual prediction would to a con- 
siderable extent counterbalance each other, the new categories gave 
a cutting point for a total percentage prediction as well as for indi- 
vidual prediction. The classification scheme, along with the two 
questions on which it was based, is given below. 


Question 24: Do you think you could get work with the employer you 
worked for before you came into the Army, if you wanted to? 


Question 26: Do you think you actually will go back to work for the same 
employer you worked for before you came into the Army? 


Classification Answer to question 24. Answer to question 25 
Definite plans to return Yes, I’m almost sure Yes, I’m almost sure 
I could I will 
Tentative plans to return Yes, I’m almost sure Yes, I probably will 
I could 
or 


Yes, I think so, but | Yes, I’m almost sure 
I’m not sure Yes, I probably will 


Considering returning Yes, I’m almost sure 


but not planning to I could I may, but I prob- 
Yes, I think so, but ably will not 
I’m not sure 


Not considering returning All other combinations 
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It was found that among July separatees at least one sixth of the 
men had entered the Army before the Selective Service System came 
into effect. These men, of course, tended to have the highest 
accumulation of points for demobilization. Somewhat less than 
two thirds of the July separatees had been gainfully employed at 
full-time jobs just prior to entering the service. Another 11 per 
cent reported that they had had part-time jobs before entering the 
service. Among December separatees, on the other hand, slightly 
over four fifths had been gainfully employed at full-time jobs and 
about 3 per cent had been employed part-time before entering the 
service. The preinduction employment experience of the Decem- 


TABLE 1 


CoMPARIS0N OF PLans oF Warrs Enuistep Mgn To RETURN TO 
Previous EmMPLormrk AMONG MEN IN SERVICE IN Summmr 1944 
AND MEN SEPARATED FROM SERVICE IN JULY AND 


DeEcumMBER 1945 
(Per cent) 
Separatees Separatees 
Men in service July December 
Summer 1944 1946 1946 
Total 100 100 100 
Had worked full time for an 
employer 80 64 82 
Definite plans to return (23) (16) (29) 
Tentative plans to return (26%) (7) (17) 
Considering returning (11) (13) 
Not considering returning (31) (30) (23) 
Had not worked full time for 
an employer 20 36 18 


* Wording of anawer category was such as to include all those considering returning but not definitely 
planning to do so. 


ber separatees approximated that of the total Army as revealed by 
earlier world-wide surveys. This was in accordance with expecta- 
tion, since they were being released about midway in the demobili- 
zation process and in most characteristics were fairly close to Army 
averages as of peak strength. 

Table 1 presents a comparison of the plans of white enlisted men 
surveyed in the world-wide cross examination of summer 1944, and 
those of July and December separatees at the time of discharge. 
Clearly, July separatees who had been employed full time just be- 
fore entering the service were planning to return to their former 
employers in lower proportion, and December separatees were plan- 
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ning to return in higher proportion than were enlisted men who had 
been surveyed in the summer of 1944. The first question to be 
answered, in evaluating the use of soldiers’ plans as predictors, was 
whether the differences reflected the effect of different characteris- 
tics within the three sample groups, or whether there had been real 
shifts in the plans of soldiers with similar characteristics. 

An analysis of characteristics associated with plans to return to 
one’s preinduction employer revealed that such plans were held in 
highest proportion among older men, those with relatively short 


TABLE 2 


Pians To Rerurgn To Previous Emp.oyer, BY DuraTION oF PREVIOUS 
EmPLoYMENT 
(July and December 1945 Separatees) 
(Per cent) 


LENGTH OF PREVIOUS EMPLOYMENT 


Less than 1-2 2-5 Five years 
1 year years years or more 
July Separatees 

Total 100 100 100 100 

Definite plans 12 22 31 49 
Tentative plans 9 11 12 16 
Considering returning 20 18 17 14 
Not considering returning 59 49 40 21 

December Separatees 

Total 100 100 100 100 

Definite plans 18 35 47 64 
Tentative plans 22 24 17 16 
Considering returning 17 19 13 13 
Not considering returning 43 22 23 7 


service in the Army, and those longest employed in the job held 
just before induction. The last-mentioned variable, however, was 
found to account for all of the correlation between age and plans to 
return and most of the correlation between length of Army service 
and plans to return.’ Even when groups of the same duration of 
employment are compared, however, December separatees con- 
sistently expressed plans to return to their previous employers in 
substantially higher proportion than did July separatees (Table 2). 
The same was found to be true when groups of identical age, dura- 
tion of Army service, and education were compared. 

* There was a considerably higher relationship between plans to return and length of 


Army service, duration of previous employment being held constant, among July sep- 
aratees than among December separatees. 
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Performance of Separatees 


In order to establish the relationship between plans expressed at 
separation and performance defined in terms of an attempt to carry 
out those plans, the follow-up questionnaire asked not only about 
current employment status but also about efforts to return to pre- 
induction employer. The question asked and the distribution of 
responses of July and December separatees are reported in Table 3. 


TABLE 3 


Responses oF JuLY aND DecemBER SEPARATEES TO QUESTION RELATING TO 
Return to PrReinpucrion Emp.orver 48 or Two To Four 
Monrss aFrar DiscHaRGE 


Question 6: Since you got your discharge, did you 


go back to work for the employer July December 
(company, person, etc.) you worked separatees separatees 
for before you entered the Army? (per cent) (per cent) 
Total 100.0 100.0 
I wasn’t working for an employer before I went 
into the Army 22.8 12.1 
Yes, I got a job with my old employer and I am 
working there now 19.3 38.2 
Yes, I got a job with my old employer but I am not 
working there any more 3.1 4.1 
I tried to get a job with my old employer, but 
couldn’t L.7 3.6 


No, I asked about a job but decided not to work 


there 4.1 6.0 
No, I haven't started working there yet, but I plan 

to go back with my old employer 3.6 4.3 
No, I didn’t try to go back and I don’t plan to 43.0 29.1 
No answer 2.4 2.6 


The extent of return of July and December separatees to work for 
their preinduction employers was very closely in accordance with 
predictions made on the basis of their plans. Thus 23 per cent of 
the July separatees had expressed definite or tentative plans to 
return, and as of two to four months after discharge 22.4 per cent 
had returned and another 3.6 per cent still planned to return, while 
1.7 per cent had tried to get their old jobs back but had been unsuc- 
cessful. In all, then, 27.7 per cent may be classified as having 
returned or tried to return. Again, among December separatees 
46 per cent had expressed plans indicating that they were leaning 
toward return, and within three to four months after discharge 42.3 
per cent had returned, another 4.3 per cent still planned to return, 
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and 3.6 per cent had tried to return but had been unable to do so. 
Thus, 50.2 per cent may be classified as having returned or tried to 
return. 

Actually, the accuracy of the prediction of return to preinduction 
employer was probably somewhat higher than indicated by the 
above figures. It appears that about 2 per cent of each group of 
separatees was comprised of men who reported that they had re- 
turned or had tried to do so, yet who had at the time of discharge 
indicated that they were not employed immediately prior to their 
induction. In classifying plans to return, it will be recalled that 
men not employed full-time immediately prior to induction were not 
included, since they presumably had no real claim to jobs.*® 

It will also be noted that answers to the category ‘I wasn’t work- 
ing for an employer before I went into the service” give a consider- 
ably lower estimate of the number not gainfully employed than was 
given by questions in the questionnaire administered at the separa- 
tion centers. (See Table 1.) Some men employed only part-time 
before induction answered the follow-up question in terms of their 
relation to their part-time employer. Some men not employed at 
all prior to induction checked “I didn’t try to go back” instead of 
checking that they had not been employed; others simply did not 
answer the question. All of which again indicates the problem of 
maintaining exact specification of dimensions in asking such ques- 
tions. 


Prediction for the Individual Separatee 


Among both July and December separatees there was a consistent 
relationship between plans classification and performance. The 
large majority of soldiers who had planned definitely to return actu- 
ally did so; a smaller proportion of tentative planners returned ; while 
a sizable minority of those considering returning and relatively few 
of those not considering returning actually did so. 

It has been noted that December separatees planned to return 
(and did return) to work for former employers in much larger pro- 
portion than did July separatees. It will be noted (Tables 4 and 5) 
that for every category of plans with respect to returning to prein- 
duction employer, December separatees returned in higher propor- 
tion than did July separatees. Whereas 82 per cent of the July 


5 An alternative method of prediction, which did not eliminate men who were not 
employed immediately prior to discharge, yielded estimates of 26 per cent and 47 per 
cent return for July and December separatees, respectively. 
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separatees with definite plans to return did go back to their old jobs, 
had attempted to go back, or were planning to do so, 85 per cent of 
the December separatees with definite plans to return had at least 
tried to carry out their plans or were planning to sustain them. 
Again, 11 per cent of the July separatees who originally had not 
planned to return to their old employers apparently changed their 
minds and at least tried to go back or were planning to do so, but 
among December separatees with comparable plans, 24 per cent 
returned, or attempted to do so, or were intending to return. 


TABLE 4 
Ra.ationsHir BETWEEN PLANS TO ReruRNn TO PREVIOUS EMPLOYER AND 
or Rerurn witsiIn Two To Four Montss aFrrer DiscHARGE 


(July 1945 Separatees) 
(Per cent) 


PLANS EXPRESSED AT SEPARATION* 
STATUS TWO TO FOUR 


MONTHS AFTER DISCHARGE Definitely Probably Possibly Plan not 
return return return to return 
Total 100 100 100 100 
Had returned to previous 
employer 71 44 26 7 
Planning to return 9 8 5 2 
Tried, could not get job 
back 2 3 2 2 
Asked, but decided not to 
return 6 6 8 5 
Made no effort to return 12 39 59 84 
Number of cases 296 181 206 5284 


* By men who had worked full-time for an employer just before entering the Army. 


There is clear evidence that December separatees were more likely 
to return, regardless of their plans. 

This fact would seem to reflect the differences in the labor market 
situation encountered by the two groups of separatees rather than 
marked differences in the personal characteristics of the separatees 
themselves. Among July separatees who at the time of discharge 
were somewhat undecided in their plans but were considering going 
back to their old jobs, a third did return to their former employers 
within two to four months after discharge. The other two thirds 
apparently found other prospects more attractive. But among 
December separatees in the same category as to plans, only about 
half found more attractive alternatives to returning. Indeed, on 
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the basis of performance the plans category “considering returning” 
for the December group came very close to being equivalent to ‘will 
probably return” for the July group. 

For both July and December separatees, the cutting point for 
prediction of returns had represented a lucky choice, and errors of 
prediction on the two sides of the cutting point almost exactly bal- 
anced each other. The relation of plans to individual performance, 
however, indicates that there was no sharp dividing line between 


TABLE 5 


RELATIONSHIP BETWEEN PLANS TO RETURN TO Previous EMPLOYER AND 
Extent oF Rerurnn witHin THREE TO Four Montss AFTER DiscHaRGB 
(December 1945 Separatees) 

(Per cent) 


PLANS EXPRESSED AT SEPARATION* 
STATUS THREE TO FOUR 


MONTHS AFTER DISCHARGE Definitely Probably Possibly Plan not 
return return return to return 
Total 100 100 100 100 
Had returned to previous 
employer 79 57 37 15 
Planning to return 4 6 8 3 
Tried, could not get job 
back 2 7 4 6 
Asked, but decided not to 
return 4 10 9 7 
Made no effort to return 11 20 42 69 
Number of cases §98 851 274 480 


* By men who had worked full-time for an employer just before entering the Army. 


plans categories but rather a gradation. If the war with Japan had 
not ended in August 1945, it is possible that considerably fewer July 
separatees would have returned to their former employers—and 
that the cutting point for prediction would have been just below 
the group with definite plans to return. Again, if the labor market 
had been any tighter for the December separatees, it is possible that 
more than half would have sought their old jobs and that the best 
estimate of total return would have included not only the men with 
definite or tentative plans to go back but also those who had said: 
“TI may return but probably will not.” 


The Adequacy of Plans as a Basis for Prediction 


In addition to the questions already described as the basis of clas- 
sification of soldiers’ plans to return to previous employers, the 
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“summing up” question toward the close of the questionnaire was 
also used as a basis for estimating the probable rate of return. The 
question, involving a final choice of alternatives, has already been 
quoted in connection with plans to return to school.’ The first 
answer category was “I will probably work for the employer I was 
working for just before I came into the Army.’”’ This question, it 
will be recalled, was followed by another which asked about the 
degree of certainty of the soldier’s plans. To be classified as having 
definite plans to return to work for one’s previous employer, by this 
scheme, a man had to check the appropriate category in the ques- 


TABLE 6 


CoMPARISON OF ALTERNATIVE Meruops oF CLASSIFICATION OF SEPARATEES’ 
Pians To Rerugn To Previous EMPLOYER 


Separatees Separatees 
Interest in returning to previous July 1945 December 1945 
employer (per cent) (per cent) 
First Method (direct questions) 
Definite plans to return "Sh ra 48 
Tentative plans to return 7 17 
Considering returning 11 13 
Other (including men who had 
not been employed) 66 4l 
100 100 
Second Method (final alternatives) 
Definite plans to return it 26 a} 47 
Tentative plans to return 11 20 
Other plans 74 53 
100 100 


tion involving alternatives (Q 68) and check “I am very sure”’ (that 
this is what I will do first of all after discharge) in answer to the 
question relating to degree of certainty. All other men who 
checked that they thought they would probably return to work for 
their preservice employer (but who were not “very sure’’) were 
classified as having tentative plans to return. A comparison of the 
proportions of men classified as having plans to return to their pre- 
vious employers, based upon the alternative methods of classifica- 
tion, is given in Table 6. 

It will be noted that predictions based upon the alternative meth- 
ods were closely similar—assuming that most men with definite or 


7 The full question is reported in Appendix A to this chapter. See questions 68 and 
69. 
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tentative plans would carry out those plans and that roughly com- 
pensating numbers who had not planned to return would actually 
do so. 

As would be expected, there was a high degree of consistency in 
classifications based on the two methods. Thus, of 1,713 Decem- 
ber separatees who had worked for an employer just before entering 
the Army, the same prediction—whether return or nonreturn—was 
given by the two methods in 1,489 or 87 per cent of the cases. (See 
Table 7.) 


TABLE 7 


Extmant oF ConsisfENcy oF Prepicrion Usina ALTERNATIVE MxrHops oF 
Pians CLASSIFICATION, AND EXTENT OF RETURN TO 
Previous Emp.orer ror Eace 


(December Separatees) 

Prediction based Prediction based Per cent 
on original method on second method Number of who actually 
of classification of classification separatees returned* 
Return Return 841 83 
Return Nonreturn 118 49 
Nonreturn Return 106 60 
Nonreturn Nonreturn 648 26 


* Or planned to return or had tried to return as of three to four months after discharge. 


The two groups of men whose expression of plans was inconsistent, 
by the criterion of alternative classification, tended to counterbal- 
ance each other and to return to their previous employers in reason- 
ably similar proportions. 

Of the errors in prediction of return, only about a third came from 
the two rather small groups which appeared inconsistent in the ex- 
pression of their plans. The other two thirds came from the larger 
groups who had consistently reported plans to return or consist- 
ently reported plans not to return. It would appear, therefore, 
that while manifest uncertainty of plans expressed at the time of 
discharge was a significant source of error in prediction, the modifi- 
cation of plans after discharge was probably of greater importance 
as a source of predictive error. 


Population Characteristics and Prediction of Return 


As has already been mentioned, the most important characteristic 
associated with plans to return to preinduction employer was the 
length of time a man had been with that employer. In addition to 
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its relationship to plans expressed at the time of discharge, duration 
of previous employment was also found to affect the extent of 
actual reiurn by men who expressed similar plans. That is, among 
men with definite plans (or tentative plans, or no plans) to return 
to their previous employer, those longest employed went back in 
higher proportion than those employed for a shorter period (Table 
8). 
TABLE 8 


Rats or Return To Previous EmMPpLover PER HUNDRED SEPARATEES REPORTING 
SrEctFIED PLans To RErukN AND SpPEcIFIED DURATION OF PREVIOUS 
EMPLOYMENT 


(July and December 1945 Separatees) 


RATE OF RETURN BY PLANS EXPRESSED 


DURATION OF PREVIOUS AT SEPARATION 
EMPLOYMENT 
Definitely Probably Possibly Plan not 
return return relurn to return 
July Separatees 
Less than one year 70 } 50° } 31 8 
One year up to two years 79 12 
Two years up to five years 79 61 41 14 
Five years and over 89 
Average rate for plans group 
(from Table 4) (82) (55) (33) (11) 
December Separatees 
Less than one year 76 63 41 21 
One year up to two years 83 72 56 23 
Two years up to five years 86 
Five years and over 93 ” 53 29 
Average rate for plans group 
(from Table 5) (85) (70) (49) (24) 


* Combinations made where single classes contained fewer than fifty cases. 


For both July and December separatees, tabulations to show ex- 
tent of return, by length of Army service, by age and by education, 
holding constant plans and length of previous employment, re- 
vealed no significant relationships. That is, among groups of com- 
parable plans and comparable duration of employment, older men 
were no more likely to return than were younger men, men in the 
Army less than three years were neither more nor less likely to re- 
turn then were those in the Army for longer periods, high school 
graduates and college men were no more likely to return than were 
soldiers whose education had not extended beyond grade school. 
There was a slight tendency for those high school graduates who 
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had not planned to return to their previous employers to stand by 
their plans (i.e., not to return) to a somewhat greater degree than 
was the case among men with only grade school education, but this 
was the only group for which rate of return seemed to be related to 
education. 


Type of Employer and Rate of Return 


The questionnaire administered at the separation centers had 
asked for a report of the type of employer for whom the men had 
worked in their last civilian jobs—whether a private employer with 
fewer than fifty employees, a private employer with fifty or more 
employees, or a branch of Federal, state, or local government. An 
analysis of the extent of return to previous employers, by type of 
employer, of December separatees revealed that 40 per cent of the 
men who reported they had worked for small private employers, 61 
per cent of those who reported they had worked for larger private 
employers, and 52 per cent of those who said they had been govern- 
mentally employed® actually returned to their former employer. In 
part, these differences reflect a difference in the plans of the three 
groups. Even with duration of previous employment held constant, 
plans to return to their preinduction employer were more frequently 
expressed by men who had worked for larger companies than by 
those who had worked for companies or individuals employing less 
than fifty persons. Part of the greater degree of return by the em- 
ployees of larger companies, however, seems to be independent of 
plans expressed at separation. Consistently higher rates of return 
were noted for the employees of larger companies when plans and 
duration of previous employment were held constant.° 

Of men who had worked for more than three years for companies 
employing fifty or more workers, nearly four fifths planned to return 
and a slightly higher proportion actually returned. Only three 
fifths of the men who had worked for smaller companies for a com- 
parable time planned to return, and the same proportion actually 
returned. It is interesting to note that a number of industrial 


* The self-reporting of governmental employment is known to contain a large element 
of error because of misconceptions held by many soldiers as to the scope of such employ- 
ment. Thus many classified work in plants with war contracts as federal employment. 

* Because of the small number of cases involved when comparing employees of large 
and small companies, matching length of preservice employment and plans to return, 
the significance of the differences between matched pairs was tested by the use of the 
statistical signs test after eliminating all pairs in which either member contained leas 
than twenty cases. In six out of six pairs included, the rate of return was highest for 
employees of larger companies. 
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giants have reported the return of more than 90 per cent of their 
former personnel who entered the service."° The number of gov- 
ernmental employees involved was too small to permit the use of 
controls adequate to support any generalizations in this respect. 


Occupation and Return 


Inquiry into the occupational plans of soldiers surveyed in the 
summer of 1944 revealed that those men who had been engaged in 
occupations of higher prestige and skill requirements were more 
likely to plan to return to the same general field after the war than 
were those whose jobs had been at lower levels of prestige or skill. 
Thus 54 per cent of the skilled workers, 42 per cent of the semiskilled 
workers, and 32 per cent of the unskilled workers expressed plans to 
return to the same kind of work after the war (regardless of whether 
for the same or a different employer). Data on preinduction occu- 
pation were coded for July separatees but not for December sepa- 
ratees. For the July group it was found that skilled workers 
planned to return and did return to their preservice employers in 
slightly higher proportion than did semiskilled or unskilled workers, 
but no differences in plans or in performance were noted between 
the latter two groups. Moreover, all of the excess in rate of plans 
and of return on the part of the skilled workers appears to be asso- 
ciated with the fact that, by and large, this group had been longer 
employed by their preservice employer. The number of cases in- 
volved in other occupational groups was too few to permit adequate 
analysis of possible differences in rate of return. 


The Situation Encountered after Discharge 


Mention has already been made of the fact that December sepa- 
ratees encountered a markedly different employment situation than 
had July separatees. Unemployment had been very slight in July; 
by January 1946, it had passed the two million mark. In July less 
than 200,000 men were discharged; in December more than a million 
left the services to compete for places in the labor market. Between 
the two groups of separatees, the rate of entrance into jobs was con- 
siderably slower for the December group. Among those who had 
taken jobs, in the first month after discharge, job turnover was lower 
for the December group. Again, among December separatees the 
percentage who reported that they tried to get a job with their 


1% See, for example, the report of the Standard Oil Company of New Jersey in the 
pamphlet ‘‘A Generation of Industrial Peace’ (1946), p. 5. 
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former employers but were unable to do so was considerably greater 
than the percentage of July separatees giving this report. 

At the time of discharge, December separatees were to a consider- 
able degree aware of the situation which they faced. The shift in 
their plans, as compared with those of July separatees, attests their 
awareness. But the data on their performance, for groups of com- 
parable plans (Tables 4 and 5) suggest that those who found they 
could return, whether or not this was their primary desire, were 
more often interested in the security of a job than the July separatees 
had been. Again it must be emphasized that although the predic- 
tions made on the basis of soldiers’ plans turned out to be surpris- 
ingly accurate for the two sample groups, a markedly different set 
of conditions after discharge might have altered greatly the actual 
rates of return. 


SECTION II 


SOLDIERS’ PLANS FOR FARMING! 


About 10 per cent of the men who enlisted or were drafted into 
the Armed Forces from the onset of Selective Service were employed 
in agriculture just prior to their enlistment or induction. But 
nearly another fifth of all the men in service had had at least a year’s 
experience, full-time, at working on a farm. Many had left the 
farm years before the war emergency, as part of that cityward flow 
of man power which has gone on since the first cities were estab- 
lished. The city offered jobs, excitement, freedom, as contrasted 
with the lack of opportunity in many rural areas where the annual 
increment of infants was larger than the increment of job openings, 
where life moved at a slower, more even pace, and where the like- 
mindedness and community solidarity of the adult population 
seemed sometimes to preclude the possibility of enjoying ‘‘modern” 
pleasures. Other men had shifted from working on a farm only 
when the war emergency gave rise to the tremendous boom in fac- 
tory employment and wages. 

The war reduced the man power available to operate America’s 
farms by a million and a half, but older men, women, and adoles- 
cents stepped into the breach to the extent that, with improved 
farm practices, the agricultural yield during the war years was 
greater than ever before. Moreover, it was estimated that the 


The main findings of the report Soldiers’ Plans for Farming after They Leave the 
Army bave been presented in Chapter 13 of Volume II of this series. 
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number of farm youth becoming available to operate or work on 
the farms would be more than enough to replace all of the older men 
dying in the next ten years. With improved technology, it was esti- 
mated that 300,000 fewer agricultural workers than were on hand 
in the war year 1943 could produce all the farm products which 
would be needed. !? 

In view of these facts, it was important to know what effect mili- 
tary service would have with respect to the attractiveness of farm- 
ing as an occupation. Would men without farming experience 
develop a hankering for the quiet life of the farm where they might 
be free from regimentation? Would there be a tremendous demand 
from veterans for farm land? Several journalists painted pictures 
of city veterans flocking into farming—a trend which, if it materi- 
alized, could only result in disillusionment for many of those not 
trained in modern farming methods. 


Ascertaining Plans for Farming 


Most of the pretest work with respect to ascertaining soldiers’ 
plans to farm was devoted to getting at the details of such plans— 
whether men planned to farm as operators, family workers, or wage 
hands, the general type of farming they planned to do, etc. Al- 
though some inconsistency was noted in comparing answers to sev- 
eral questions dealing with the general classification of plans to farm 
rather than to enter some other field, the problems here encoun- 
tered did not seem as acute as those in fields like government em- 
ployment or self-employment in nonagricultural enterprises. Per- 
haps the reason for this is that relatively few soldiers who did not 
have a background of farming experience expressed serious interest 
in an agricultural career. 

Basic classification of plans to farm in all major surveys on this 
subject utilized responses to the write-in question and a direct check 
list contained several pages later in the questionnaire: “Do you 
think you will do farming when you get out of the Army?” This 
direct question on plans to farm was preceded by two questions on 
previous farming experience and followed by a battery of questions 
on the detailed plans of those men planning to farm. (See Appen- 
dix A, questions 48-58.) 

The cross-sectional survey conducted in the summer of 1944 found 
approximately 10 per cent of the white enlisted men to be seriously 


12 See Carl C. Taylor, ‘The Veteran in Agriculture,” Annals of the American Academy 
of Political and Social Science, Vol. 238 (March 1945), p. 50. 
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considering full-time farming as an immediate postwar job plan. 
Included in this 10 per cent were 7.7 per cent who expressed definite 
and consistent plans in answer to both the write-in and the direct 
check-list question," 1.0 per cent who checked that they were “‘al- 
most sure” they would farm yet mentioned in the write-in that they 
were considering an alternative plan, and 1.4 per cent whose write-in 
answer mentioned farming but who checked “I may do some farm- 
ing, but I’m not sure.” Clearly these are not scale classes, but it 
seemed probable that those expressing definite plans would sustain 
those plans in highest proportion and that those who checked “TI 


TABLE 9 


ATTITUDE TOwARD MaxkinG Farmina Toem Lirswork aMona MEN 
Consiperine Foir-tiwe Farsmine as Postwar Occupation 


(Summer 1944) 


Plan definitely Think they will Say they 


Altitude toward farming to farm probably farm may farm 
as lifework (per cent) (per cent) (per cent) 
Total 100 100 100 
Expect to make farming life- 
work 73 51 13 
Expect to work on farm but 
might change later 24 39 50 
Expect to work on farm only 
if cannot get other job 1 6 26 
No answer to question* 2 4 11 
Number of respondents 1,190 1486 221 
Per cent of total sample 7.7 1.0 1.4 


* Question was included in block to be answered only by men planning to farm. A high proportion of 
failures to answer the question is thus indicative of a lack of plans to farm. 


may do some farming” would be least likely of the three groups to 
sustain their plans. The chief reason for this expectation was the 
finding of the way in which members of the three groups responded 
to this question : ‘Do you think you will make farming your lifework 
or do you expect to change to some other kind of work later on?” 
As will be seen in Table 9, nearly three fourths of the men who had 
expressed definite plans said they expected to make farming their 
lifework, as against half of those who mentioned that they were con- 
sidering alternatives but thought they would probably farm, and 
an eighth of those who checked merely “I may do some farming.” 


4% To this group were added men who failed to answer one of these questions or the 
other but who otherwise consistently expressed plans to farm. 
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The three groups also differed in the proportion having had at least 
a year of full-time farming previous to Army service. Among those 
with definite plans, 86 per cent had had at least a year of full-time 
experience while for the other two groups the figures were 78 and 66 
per cent respectively. (Only 2 per cent of those with definite plans 
were wholly lacking in farming experience.) 

Of particular interest in this part of the study of soldiers’ plans 
was the specific type of farming arrangement planned. A majority 
of those employed on farms before induction had either worked on 
family farms or as wage hands, and only a third reported that they 
had owned or rented a farm (for cash or for a share of the crop). 
Among those with definite plans to farm, however, 70 per cent said 
they planned to operate a farm which they would own or rent. As 
might have been expected, moreover, those who had operated farms 
were much more likely to plan to return to farming than were the 
former wage workers on the farms. 

It was found that men who said, in answer to one question, that 
they had in mind a farm which they planned to buy or rent, fre- 
quently indicated elsewhere that they planned to work on a family 
farm or for wages immediately after the war. In other words, they 
aspired to become farm operators but would be content for a while 
to work on someone else’s farm. Other men indicated that if they 
could not rent or buy a farm they would not be interested in farm 
employment. Thus the estimation of the number of men who 
would actually farm, in a sense, required several predictions. A 
quotation from the report of the 1944 survey poses the problem: 

“Prediction of the number of men who will actually farm cannot 
be made solely on the basis of the men’s intentions of farming. The 
intentions of the men at this time may be relatively unimportant as 
determinants of later action unless certain conditions are met after 
the war. Among these conditions the following may be considered 
as of major importance: (1) there must not be too great a differential 
between opportunities open in agriculture and those offered by in- 
dustry ; (2) there must be farms available for those men who will go 
into farming only if they can be operators; (3) men counting on 
loans guaranteed by the government under the GI Bill of Rights in 
order to finance farm operation must actually be able to qualify for 
such loans.” 

The problem of estimating how many soldiers might return to 


\“ Information and Education Division Report No. B-131, Soldiers’ Plans for Farm- 
ing after They Leave the Army, p. 32. 
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farming after the war was also complicated by the fact that the use 
of a questionnaire which was filled out by the respondent underrep- 
resented a segment of the Army population containing a high pro- 
portion of potential fariners—the illiterate. In surveys of plans of 
Negro troops, illiterates were systematically interviewed, but among 
white troops it was estimated that illiterates comprised only about 
2 to 4 per cent and therefore they were simply excluded from con- 
sideration. In the present instance, however, it was estimated that 
roughly half of the illiterates were prospective farmers and farm 
laborers. In preparing estimates of the total number of soldiers 
likely to go into agricultural work after the war, allowance was 
made for the existence of the unrepresented illiterates, but the effect 
of their presence in the body of prospective farmers was unfortu- 
nately not reflected in the data on the details of plans for farming. 


Postseparation Performance 


Data on the proportion of separatees who had gone into agricul- 
tural work shortly after discharge were secured only for July sepa- 
ratees. Nine per cent of the July separatees reported that they had 
been farmers or farm workers just before entering the service, and a 
little less than 8 per cent were seriously considering returning to 
farming. As of two to four months after discharge, however, only 


TABLE 10 
RELATIONSHIP BETWEEN PLANS TO FARM AND POSTSEPARATION PERFORMANCE 


(July 1945 Separatees) 
(Per cent) 


STATUS TWO TO FOUR MONTHS AFTER DISCHARGE 
PLANS EXPRESSED AT TIME 


OF SEPARATION Total Farming Not farming 
Total 100.0 6.5 98.6 
Definite plans to farm 4.7 2.8 19 
Tentative plans to farm 2.4 1.0 1.4 
Vague plans to farm 8.1 1.2 6.9 
No expressed plans to farm 84.8 1.5 83.3 


6.5 per cent reported that they were engaged in farming. For two 
reasons, it is likely that this figure does not accurately reflect the 
actual extent of return of July separatees to farming, however: (1) 
follow-up responses were secured from most men in October, when 
agricultural employment would have declined somewhat, and (2) 
some men were postponing their return to farming until the follow- 
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ing spring. Nevertheless, it is instructive to consider the relation- 
ship between plans and performance in so far as data are available 
(Table 10). Roughly three fifths of those men who had expressed 
definite plans to farm actually carried out their plans in the first 
few months after discharge, as did two fifths of the men who had 
expressed tentative or not wholly consistent plans to farm. To- 
gether these two groups contributed somewhat more than half of 
the men who were farming after discharge. Nearly all of the men 
who were farming had had some previous experience in this field and 
nearly two thirds of them said they had worked full-time on a farm 
for a year or more. This helps to explain the rather large group 
who reported in the follow-up that they were farming even though 
they had previously checked ‘‘No, I’m almost sure I won’t farm.”’ 
Most of these men were either working on their family’s farm or 
were working for wages on someone else’s farm—perhaps having 
taken, temporarily, at least, the first available job for which their 
previous experience fitted them. 

In general, holding constant the basic classification of plans, it 
appears that men with at least a year of full-time experience at 
farming, men who said they wanted to make farming their lifework, 
and men who planned either to work on their family’s farms or to 
operate their own farms were more likely to carry out their plans 
than were men with less experience, those less enthusiastic about a 
life career in farming, and those who had planned to work as hired 
hands. This latter finding is of particular interest since it had been 
hypothesized that those planning to operate farms might not be 
willing to return to agriculture if they could not manage to buy or 
rent a farm. Actually, many of these prospective operators were 
working on family farms or for wages, but they were working to- 
ward the goal which they had specified in their statements of plans. 

A comparison of the extent to which either one of the classifying 
questions alone would have permitted a correct prediction as to 
whether or not the individual separatee would go into agriculture 
after discharge revealed that the simple mention of farming on the 
write-in question more often gave a correct prediction that the man 
would farm than did the checking of the category ‘‘Yes, I am almost 
sure I will farm full-time” in answer to the direct check-list ques- 
tion. It is possible that some of the experienced farmers who 
leaned toward returning to agriculture after the war, yet who were 
considering alternative plans, were led into greater and greater un- 
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certainty as they proceeded through the questionnaire. The fact 
that many of the men planning to farin had had less educational 
opportunity and hence encountered more difficulty than other men, 
undoubtedly explains some of the inconsistencies of responses to the 
questions on their plans. 


SECTION III 


SOLDIERS’ PLANS FOR BUSINESS OWNERSHIP 


Prior to any follow-up of separatees, the following hypothesis was 
jotted down for future evaluation: ‘The proportion of men who 
carry out their plans, or attempt to carry them out, will be highest 
among those planning definitely to return to their previous employ- 
ers and lowest among those planning to start businesses of their 
own. More generally stated, the proportion of men who carry out 
or try to carry out their plans will be directly related to the amount 
of past experience and knowledge the men have in the area of their 
plans.” This hypothesis was simply a common-sense statement, 
and one which was well borne out by the results of the follow-up 
studies. In the case of soldiers’ plans for businesses and for govern- 
mental employment, plans were often based upon wishful thinking 
and upon misconceptions and were less highly predictive for the 
individual respondent than were plans in the areas already described. 
The reasons for this will be described in detail in the following 
sections. 

Survey data indicated that nearly 90 per cent of the Army’s 
male personnel entered the service from school or from jobs which 
they had held asemployees. Only about 8 per cent of a large cross- 
sectional sample reported that they had been self-employed before 
entering the service, and nearly half of these had been engaged in 
farming, chiefly as tenants. Thus less than 5 per cent had been 
engaged in small businesses or other nonagricultural self-employ- 
ment. Nevertheless, the traditional American goal of independ- 
ence in a small business enterprise was strongly planted in the 
consciousness of many of the men in service. And when the Serv- 
icemen’s Readjustment Act of 1944 offered, or seemed to offer, the 
prospect of a loan for getting started in business to any veteran who 
wanted to go into business, many servicemen began to view their 
dreams as plans for their postwar world. Fed-up with taking 
orders from others of higher rank in the service, not a few soldiers 
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expressed an aversion to returning to jobs where they would again 
be “outranked.”’ 

But dreams and intentions do not automatically lead to their own 
materialization. Many soldiers with a dream actually had little 
more than that dream as a basis for building a successful business. 
A large proportion, perhaps a majority, of those who were thinking 
about a postwar business could not be considered good enough risks 
to secure loans under the Federal program of aid to veterans. No 
one who interviewed them in order to check the realism of their 
thinking and planning would predict that most of these men would 
actually get started in businesses of their own in the immediate 
postdemobilization period. 

One of the most troublesome problems encountered in working 
with statistics on small businesses relates to the matter of defining 
a business. Common usage is to classify as self-employed all per- 
sons whose employment and income rests upon their own enterprise 
and who are neither regular wage, salary, or commission workers 
nor unpaid family workers. In theory, at least, the self-employed 
may further be classified as own-account workers or as employers. 
All of the employers may be said to operate businesses (if profes- 
sional practices are considered business enterprises) but common 
usage would not consider all of the own-account workers as proprie- 
tors of businesses. For example, the odd-jobs man is an own-account 
worker but hardly a proprietor. The picture is enormously com- 
plicated, however, by the fact that many individuals shift back and 
forth between own-account work, employment of others, and em- 
ployment by others. As of any moment in time one can presumably 
classify all persons in a particular field, although if seasonal fluctua- 
tions are great, such classification may not represent the most usual 
status of the persons classified. In attempting to predict for the 
future, the problem is even greater. Painters, carpenters, truck 
owners, landscape gardeners, artists, and members of other occupa- 
tional groups often indicated that they planned to seize whatever 
opportunities came for self-employment, yet might at times work 
for larger enterprises in their fields. 

The number of employers and own-account workers in nonagri- 
cultural enterprises as enumerated in the 1940 Census of Population 
exceeded the estimated number of business firms (based upon data 
from the Census of Business, Bureau of Internal Revenue, various 
state governmental agencies, trade associations and other sources) 
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by about one million.'® This excess is a reflection both of partner- 
ships—two or more proprietors to the firm—and of the fact that 
many own-account workers did not maintain a recognizable place 
of business and therefore were not enumerated as firms. 

In attempting to ascertain soldiers’ plans for businesses, the basic 
criteria for categorization of a business were the mention of self- 
employment, a brief description of the work planned, and the sol- 
dier’s checking “‘I plan to have a business of my own.” In general, 
interest was here focused upon the soldier who planned self-employ- 
ment which would qualify for consideration of a loan application 
under the provisions of the Servicemen’s Readjustment Act. Own- 
account workers would in most instances qualify. It is apparent, 
then, that the fuzzy edges of the concept of “‘a business” added 
appreciably to the problem of ascertaining plans to enter into such 
enterprises. 

Among the prospective users of research in this field there was 
some interest in the level of aspiration with respect to business, but 
for the most part, the government agencies charged with adminis- 
tering the loan program and with allocating the disposition of sur- 
plus property wanted to know how many veterans were likely to 
apply for aid and how many might be qualified to receive such aid. 

Nowhere in the series of studies of soldiers’ postwar plans was 
more difficulty encountered in presenting the data on soldiers’ plans 
in such a way as to prevent wholly unfounded conclusions than in 
the reporting of plans for business ownership. 

The section that follows will take up first the steps in the develop- 
ment of a set of questions to give a classification of plans for busi- 
ness ownership which would minimize the effects of suggestion and 
wishful thinking, and, second, the way in which the data on plans 
were utilized to give qualified predictions. 


Ascertaining Soldiers’ Plans to Own Businesses 


A pretest in the spring of 1944 contained the following question 
and yielded the responses indicated : 


18 Compare United States Census, 1940, Population, Vol. 3, Introduction, page 6, and 
Table 78, with estimates by Howard R. Bowen in “Trends in the Business Population,” 
Survey of Current Business, Vol. 24, No. 3 (March 1944), pp. 8-13. Additional prob- 
lems of definition and classification in this field are discussed in another article by Bowen 
et al., ‘New and Discontinued Businesses 1940-1943,” Survey of Current Business, 
Vol. 24, No. 7 (July 1944), pp. 7-18. 
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Do you expect that you will sometime start a business of your own? 


I already have a business of my own 6% 
——_—. Yes, I plan to start a businees of my own as soon as I 
get out of the Army 16 
—__.. Yes, I plan to start a business of my own but not as 
soon as I get out of the Army 
No, I don’t plan to start a business of my own 35 
No answer 3 
100% 


The question is obviously not adequate to get at specific plans, 
but it serves to indicate the high level of aspiration for proprietor- 
ship of a business among the soldiers surveyed. Interviewing re- 
vealed that many men who checked that they planned to start 
businesses immediately after discharge had no real plans but only 
hopes. Further, it was found that some men who checked “TI al- 
ready have a business of my own” had misinterpreted the question ; 
they were merely reporting that they had a regular trade or line of 
work which they planned to return to. It also appeared that a 
majority of the prospective farmers were interpreting business to 
include farm operation. Thus the task of ascertaining serious plans 
for proprietorship of a nonagricultural business enterprise was 
clearly one beset with problems. 

The use of a question presenting alternatives diminished consid- 
erably the number of men who indicated plans for a business, but 
even when self-employment was chosen over school and over work- 
ing for an employer, there was evidence that carefully considered 
plans for business ownership were still greatly overstated. The 
use of a write-in question at the very start of the series of items on 
postwar plans seemed to be the only way of cutting down the expres- 
sion of plans for a business to a group that for the most part would 
be similarly classified by an interview. 

The major survey in the summer of 1944 used responses to the 
following questions to classify soldiers’ plans for businesses: 


28. (a) What kind of work do you think you will do right after the war? 
Write the name of the job and describe it as fully as you can. 
If you are undecided, tell what you think you mighé do. 
(Four lines were left for description of plans) 


32. Do you expect to work for yourself or for someone else right after you 
leave the Army? (Check one) 

I expect to work for myself 

I expect to work for the government 

(Federal, state, city, etc.) 
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I expect to work for a relative (father, uncle, etc.) 
I expect to work for some other employer 
(company, person, etc.) 

I plan to go to full-time school 

I plan to atay in the Army 

I don’t have any definite plans 


35. Do you plan to start a business or run a farm" soon after you leave the 
Army? (Check one) 

No, I don’t plan to start a business or run a farm 

I already have a business or farm 

~_—.— Yes, I am almost sure I will start a business or run a farm 

——_— I may, but I’m not sure 


Although 12.9 per cent of the men answered the direct question 
(Q 35) by checking “‘I am almost sure I will start a business” (pro- 
spective farmers having been eliminated), only 8.7 per cent also said 
they planned to be self-employed right after leaving the Army, and 
only 5.3 per cent had also indicated definite plans for a business 
when first asked what they planned to do. The extent to which 
insistence on consistency in all three questions cut down the size of 
the groups who indicated “plans” for business in answer to the di- 
rect question is shown in Table 11. Some of the men who had said 


TABLE 11 


Exrent or Rgpucrion or EsriMaTEs oF PLANS FOR 
NONAGRICULTURAL OWNERSHIP® USING a8 CRITERION 
ConsisTENCY ON A THREE-QUESTION BATTERY 


(Per cent) 
Almost sure Do not plan 
Successive Already own Iwill start May start to start 
classification a business a business a business @ business 

Total responses to direct 

question (Q 35) 3.0 12.9 176 66.8 
Question with alternatives 

(Q 32): 

Expect to be self-em- 

ployed 2.6 8.7 3.1 
Write-in question (Q 28a): 

Definitely plan business (2.1) (5.3) (0.7) 

Considering starting busi- 

ness (0.3) (1.4) (0.7) 

No mention of business (0.2) (2.0) (1.7) 


* Men planning to farm were eliminated from the tabulation. 


‘6 Men planning to farm were eliminated by the use of an additional question which 
asked, ‘‘What kind of business or farming will you go into?” 
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they already owned businesses were actually planning to share 
management of family businesses after the war; others had been 
self-employed in small-scale enterprises and thought of themselves 
as owning “‘a business’ even though the enterprise was not operat- 
ing while they were in service and in fact its reopening after dis- 
charge was a matter of some doubt. 

Taking only those men who were completely consistent and free 
of doubts in the expression of their plans to own businesses after dis- 
charge, it will be noted that a little over 7 per cent would thereby 
be classified as definitely planning a business. A crucial aspect of 
this system of classification was the adequacy of coding of the write- 
in question. Recalling that in a cross section of enlisted men 
roughly a third had no education beyond grade school, it will come 
as no surprise to learn that the responses to the write-in question 
were not always literate or even legible, and in a considerable pro- 
portion of cases they were incomplete or ambiguous. Perhaps the 
surprising thing is the care that was taken by the great majority of 
the men to answer the question to the best of their ability. In many 
instances, of course, help was provided by the Research Branch 
representative administering the questionnaire. Nevertheless, cod- 
ing sketchy or ambiguous responses constituted a major problem 
both from the standpoint of accuracy and from that of time re- 
quired. Where the write-in question was ambiguous—for example, 
when the respondent wrote merely “trucking” as the description 
of his postwar job plans—coding clerks were instructed to examine 
the entire questionnaire for clues as to the employment status 
planned—whether the man expected to drive a truck owned by 
some trucking concern or whether he planned to operate a trucking 
business. Among the clues, of course, were answers to the check- 
list items dealing with self-employment and business ownership, so 
that coding of the write-in was in a small number of instances not 
independent of the other items used in the classification of plans 
for a business. Although all coding was verified and a high degree 
of consistency of classification was obtained, the complexity of the 
coding operation and the inevitability of differences in interpretation 
and hence of classification led to a considerable amount of interview- 
ing and experimentation with various alternative approaches to 
ascertaining plans. 

A series of pretests was carried out in order to secure a set of 
questions which would permit simpler classification of plans (espe- 
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cially through elimination of involved coding processes) yet at the 
same time would minimize suggestion. The first of these pretests 
involved the use of three forms: one in which the soldier was asked 
first what he would like to do and then what he actually expected 
to do; a second in which the sequence of questions on postwar plans 
was introduced by the presentation of a check list of alternatives 
and a write-in question carried further on was designed merely as a 
source of data for occupational classification; and a third form in 
which the sequence of questions on plans was introduced by a full 
paragraph describing in detail the kind of information wanted in a 
general write-in description of plans. 

It had been hoped that by first asking what the soldier would like 
to do after the war, and subsequently asking for his evaluation of 
the likelihood that he would actually attempt to carry out his aspira- 
tion, it might be possible to lead the respondent to a realistic con- 
sideration of his future prospects and through this to a reasoned 
statement of plans and expectations. Upon pretesting this form, 
however, it was found that many men who had thought through 
their plans were irritated by the implication that they might not be 
able to do what they would like to do, others were confused by the 
sequence, and still others with high aspirations sailed blithely along 
regardless of the questions designed to make them consider the 
problems to be encountered. Because of the tendency for this 
approach to irritate or to confuse, it was dropped from further con- 
sideration. 

The other two pretest forms seemed to work well enough in so far 
as soldier reaction was concerned but gave somewhat different esti- 
mates of soldiers’ plans. The number of cases involved was too 
small to establish the statistical significance of the differences, so a 
larger test, involving further development of the two forms, was 
then set up. 

The primary objective of this second pretest was to establish 
whether a set of check-list questions which did not depend on a 
write-in for classification of plans would give the same estimates as 
a form in which the initial write-in question was of basic importance. 
Since the use of the write-in question was most important as a means 
of evaluating check-list reports of plans for a business, principal 
interest in the pretest was focused on classification of plans within 
this area. 

The two forms were administered interleaved, the basic samples 
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comprising 1,085 cases for Form A and 1,088 for Form B. The 
distributions by age and education of respondents to the two forms 
were almost identical. 

Form A first presented a check list of alternatives of employment 
status; Form B led off with a lengthy question asking for a full de- 
scription of the soldier’s plans, and followed this with a check-list 
question presenting alternatives of employment status. A com- 
parison of the distribution of responses to the two similar but not 
identical check-list questions presenting alternatives of status sug- 
gests that responses to the question in Form B were definitely in- 
fluenced by the preceding write-in question.” (See Table 12.) 

Cross tabulation of the questions involving a choice of alterna- 
tives with the question relating directly to plans for business owner- 
ship cut down the percentage classified as planning to own a busi- 
ness from 25 per cent to 20 per cent using Form A and from 20 per 


17 The two questions are given below: 


Form A 
23a. You will probably want to take a vacation first of all after you get out of 
the Army. But after that, which one of the things below do you think you 
are most likely to do? (Check only one—the thing you think you are most 
likely to do FIRST, even if you should plan to change to something else after- 
ward.) 
Work for an employer, on salary, wages or commission 
Go to full-time school or college 
Farm for myself 
Have my own business 
—__— Stay in the Army 
I’m undecided what I will do 
Form B 
27. What type of employer do you expect to work for right after you leave the 


Army? (Check one) 


T expect to work for a branch of the government (Federal, state, 
etc.) 
I expect to work for some other employer (company, person, 
agency, etc.) 
I expect to farm for myself 
T expect to have my own business 
T plan to stay in the Army 
T plan to go to full-time school 
I haven’t decided what I will do 
Unfortunately the differences in the wording of the two questions presenting salterna- 
tives are substantial; even though many of the response categories are almos _ identica! 
it is likely that a part of the difference between the distribution of responses was oc- 
casioned by wording as well as by placement. Evidence that the write-in question led 
to a more careful consideration of plans is given by the fact that in answering the direct 
check-list question on plans for a business, respondents to Form B expressed such plans 
in significantly lower proportion than did respondents to Form A. It seems unlikely 
that this effect would have been caused by the different focuses of the two questions 
presenting alternatives, nor can it be accounted for in terms of differences between the 
two sample groups, since they were almost identical as to age, education, and preinduc- 
tion status. 
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cent to 15 per cent using Form B. Use of the write-in question of 
Form B to establish complete consistency in the expression of plans 
for business ownership further reduced the proportion classified as 
having such plans to 11 per cent. Another 2 per cent of the re- 
spondents to Form B indicated such plans in answer to the check- 
list questions, but their write-in answers were not clear as to the 
employment status planned. 

This pretest established rather clearly the need for starting off 
the sequence of questions with an item which would require some 
pause for consideration before giving an answer, and confirmed ear- 


TABLE 12 


ComPaRISON OF Responsss To Sisazak Cxaeck-List Questions PRESENTING 
ALTERNATIVES OF EMPLOYMENT Status WHEN ASKED 48 INITIAL 
Question In Sequance (Form A) aND WHEN ASKED FoLLowINa 

A GxnpraL Warits-in (Form B) 


(Per cent) 
Form A Form B 

Total 100 100 

Work for an employer 38 47 
Attend full-time school 12 ll 
Have own business 25 20 
Farm for self (operator) 9 8 
Stay in Army 3 2 
Undecided 12 8 
Other and No answer 1 4 
Number of cases 1,085 1,088 


lier judgments that an open-ended question would be less likely to 
lead the respondent into choice of an area of aspirations rather than 
of plans. It also demonstrated again rather clearly the difficulty of 
basing classification on responses to an open-ended question when 
such responses were frequently ambiguous because of the inarticu- 
lateness or laziness of the respondent. 

One further check was made on the accuracy of plans classifica- 
tion of the questionnaire. The questionnaire was administered to 
small groups of men (totaling somewhat over two hundred men) 
who were interviewed individually after they had filled it out. The 
interviewer sought to arrive at a classification of the soldiers’ plans 
by major area and by degree of certainty. Questions were raised 
with the soldier about apparent inconsistencies in his questionnaire 
responses and about some of the difficulties which might be encoun- 
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tered in trying to carry out his plans. In a number of instances the 
interviewer made his classification before having looked at the 
questionnaire and then compared it with the classification yielded 
by responses to the questionnaire. It was found that in the great 
majority of cases the use of the three-question battery, leading off 
with the write-in question, yielded identical classification with that 
made by the interviewer, when considering definite or tentative 
plans for business ownership. The major discrepancy which oc- 
curred was in the realin of vague plans, which showed up as a mark- 
edly positive response to the direct question in the questionnaire 
but were hardly apparent in the interview. In the interview it was 
easier to keep the soldier in contact with reality, and the focus of 
attention was on the plans that had been thought through. 

Although the classification of plans yielded by the questionnaire 
was similar to that obtained by interviewing, the latter technique 
permitted a far more adequate evaluation of the realism of the plans. 
It was possible to check on relevant experience, on a knowledge of 
the financial problems involved in acquiring a business, and on the 
resources available to the soldier. The questionnaire used in the 
major survey in the summer of 1944 was deficient in getting at such 
information. A set of questions which would cover the major prob- 
ing efforts of an interview was therefore added in order to obtain 
the data which seemed crucial for evaluating soldiers’ plans for 
business ownership. 


Soldiers’ Plans for Business Ownership as of Summer 1944 


On the basis of consistent responses to the three questions dis- 
cussed above, 7 per cent of the large cross section of enlisted men 
surveyed in the summer of 1944 were classified as definitely plan- 
ning to start a business, and another 4 per cent were classified as 
having tentative plans. At least a third of all enlisted men indi- 
cated some aspirations to become business proprietors. Yet even 
among the 7 per cent who consistently stated that they planned to 
start a business right after leaving the Army, there were many 
whose plans were not formulated in detail or were patently not based 
on a realistic appraisal of their own preparedness for such under- 
takings. 

Although the report which presented the findings with respect to 
soldiers’ plans for ownership of nonagricultural businesses explicitly 
stated that predictions could not be made directly from the data 
reported, and pointed out the prevalence of unrealistic planning, 
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several consumers of the research requoted the data as predictions 
of the number of veterans who would be likely to start businesses. 
As an aid to more careful interpretation of the data for planning 
purposes, an analysis of the problem of predicting the number of 
veterans likely to start businesses was prepared."*® It focused atten- 
tion on three aspects of the problem: (1) the plans of soldiers and 
the characteristics of those desiring to go into business after de- 
mobilization, (2) the extent of opportunities in the field of small 
business, and (3) the effects of changing characteristics of the popu- 
lation to be discharged, and changing conditions to be encountered 
at various phases of the demobilization process. 

It was pointed out that only about 5 of the 7 per cent who had 
definite plans for business ownership had any relevant experience or 
training in the field which they planned to enter, and only 2 of the 
4 per cent with tentative plans had such experience. Further, 
when the characteristics of men who reported plans to enter busi- 
ness for the first time were compared with those of men who had 
been self-employed in nonagricultural businesses, the former group 
was found to be younger and less well educated and to possess con- 
siderably less capital than the group of men who had been proprie- 
tors. On the other hand, the men planning initial business ventures 
were somewhat older and better educated, and possessed somewhat 
more savings than did enlisted men in the Army as a whole. 

An analysis of recent trends in the number of small businesses 
revealed a net decline of about 540,000 in the early years of the war. 
Experts in the field of business analysis were generally optimistic 
that this deficit would be wiped out soon after the end of the war 
and that perhaps an additional half million small businesses might 
come into being during the five or ten years following the termina- 
tion of hostilities.” But veterans could not count upon monopo- 
lizing the field. A quotation from the report will indicate the type 
of analysis undertaken : 


Competing with the veteran for business ownership will be two major groups: 
(1) thoee former proprietors of businesses who closed shop because war shortages 
made them unable to obtain needed stock and equipment for their continued 
operation or because they saw greater temporary attraction in war industry and 
(2) those civilians who in normal times would have turned to self-employment dur- 
ing the war period. 


4 Information and Education Division Report No. B-130A, The Problem of Pre- 
dicting the Number of Veterans Who Will Have Businesses of Their Own after They 
Leave the Army, May 1945. 

See Howard R. Bowen, “‘Trends in the Business Population,” loc. cit. 
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Relatively few urban males become employers or own-account workers before 
reaching the age of thirty or thirty-five. In 1940, for example, four fifths of the 
urban males who were self-employed were thirty-five years old or over. Among 
employed men under thirty, less than 5 per cent were self-employed, while among 
those forty-five and over, more than 20 per cent were self-employed. [1940 Census 
of Population: The Labor Force (Sample Statistics): Employment and Personal 
Characteristics, Table 12.] If one applies the age rates of self-employment indi- 
cated by the 1940 Census to the Army age distribution (in each case for urban 
males), one can estimate the expected increase in the total proportion self-em- 
ployed associated with a given increase in average age of the population. This, of 
course, assumes conditions comparable to those which prevailed just before 1940. 
With five years aging of the Army population as constituted in summer 1944, one 
would expect the addition of somewhat less than 5 per cent to the total proportion 
self-employed—an addition of about 300,000 men. 

In other words, had there been no war, and had economic conditions prevailed 
similar to those in the period preceding 1940, one would have expected that about 
300,000 of the urban males now in the Army would have become self-employed 
during a five-year period. A much larger group of somewhat older urban males 
(men who were not drafted) would have become self-employed during the same 
period. And, as earlier pointed out, several hundred thousand men who had oper- 
ated businesses of their own before the war and gave up such businesses to enter 
war industry must be counted on to return to self-employment. 

If [previously existent] conditions were to prevail, then, one would expect that 
considerably less than half of the new business proprietors in the next five years 
would come from the population now in the Army. With the former proprietors 
now in war industry considered, one might expect veterans to number not more 
than a fourth to a third of the men who become self-employed in the next five years, 
if it were not for added motivation and financial help under the GI Bill of Rights 
which may induce relatively more veterans to attempt to have their own busi- 
nesses. 

The consensus among those who have explored the probable effects of the loan 
provisions of the GI Bill is that most men who can qualify for business loans under 
the provisions of the bill could secure such loans without this legislation. The 
guarantee of minimum earnings of $100 a month may be a somewhat greater aid 
to the prospective entrepreneur. However, relatively few enlisted men have any 
considerable fund of savings for equity capital; a much larger group of civilians 
will have available capital adequate both for equity and for operating purposes, 
and in general the older civilians will have a greater amount of relevant experience. 
It would appear, therefore, that the provisions of the GI Bill will enable some 
veterans to start businesses which could not have been established without such 
aid, but that few veterans will secure from the bill any lasting competitive ad- 
vantage over nonveterans. . . . 

If the above reasoning is correct, one would predict that, in the immediate post- 
demobilization period, the 7 per cent of the Army’s male personnel who have 
both plans for business ownership and a fair share of the prerequisites for success 
might represent a maximum estimate of the number who can hope to succeed in 
business ventures. If many more men try to establish new businesses, a high pro- 
portion of failures must be expected.” 


Even the final statements in the above quotation must be consid- 
ered overoptimistic, however, for if applied to the total veteran 


2% B-130A, pp. 7-8. 
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population which would emerge from the war—roughly fifteen mil- 
lion—the estimated maximum of 7 per cent would yield more busi- 
nesses owned by veterans than the maximum expected increase in 
new businesses over a ten-year period. And even though some of 
these veterans would be returning to businesses they already owned, 
or would be buying or taking over other already existing businesses, 
in the short run the figure seems unrealistically high. A serious 
fallacy was involved in temporarily losing sight of the population 
to which the estimate would ultimately be applied. 

The analysis of the problem of predicting postwar business own- 
ership by veterans from a knowledge of soldiers’ plans concluded by 
noting that there would probably be a changing relationship be- 
tween plans and performance at different phases of the demobiliza- 
tion process. Both the personal characteristics of soldiers being 
discharged and the economic situation encountered by the separatees 
might be expected to undergo progressive modification. Stress 
was laid upon the unfavorable conditions for starting a business as 
long as shortages of needed materials and supplies prevailed and 
jobs requiring considerably less expenditure of effort were available 
at attractive wages. 


Business Plans of Separatees 


A cross-sectional survey of troops remaining in the United States 
in the summer of 1945 revealed no significant change (from the 1944 
survey) in the proportion planning to own businesses. Among 
enlisted separatees discharged from the Army in July of that year, 
however, less than 3 per cent of the men were definitely planning to 
own businesses while somewhat under 2 per cent expressed tentative 
plans to do so. 

Early separatees were older, on the average, than men remaining 
in service, and in this respect they might have seemed more likely 
prospects to attempt business ownership. Many of the older men, 
however, enlisted prior to Selective Service; they were men of meager 
education and few job skills.) They had neither the experience 
nor the training required for business operation, and relatively few 
of them planned business enterprises. Moreover, the war with 
Japan was continuing, and there was sufficient awareness of mate- 
rial shortages and the difficulties of starting new enterprises to lead 
some early separatees to defer their plans for proprietorship. In 
addition to the 4 per cent who expressed definite or tentative plans 
to go into business immediately after discharge, another 15 per cent 
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said they expected to go into business within five years after leaving 
the Army. 

By December of 1945 there was an even greater general aware- 
ness of the problems to be faced by the prospective veteran 
entrepreneur, as a result of articles which appeared in service publi- 
cations and in many of the popular commercial magazines. En- 
listed men discharged from the Army in December expressed plans 
to go into business in slightly higher proportion than had the July 
separatees, but in considerably lower proportion than had soldiers 
questioned in the earlier cross-sectional surveys. There had, ap- 
parently, been a real shift in plans during the early months of 
demobilization. 


Postseparation Performance 


The follow-up of July separatees two to four months after dis- 
charge revealed that 2 per cent were self-classified as working in 
businesses of their own while another 2 per cent had made some 
arrangements to enter business and expected to get into operation 
within six months.?!_ An additional 3 per cent reported some ar- 
rangements made but no definite prospects of entering business, 
and fully a third of the July separatees checked “I have been think- 
ing of getting my own business, but have not made any arrange- 
ments yet.”’ (See Table 13.) 

These findings relating to the performance of separatees in the 
first few months after they left the Army were reasonably close to 
predictions made from the plans of the men. There was, however, 
considerable shifting of plans for business operation in these first 
few months after the men returned to civilian status. Only about 
two fifths of the men who had, while in the Army, expressed definite 
or tentative plans for a business actually made arrangements to get 
under way. Another fourth of the men with definite or tentative 
plans for a business abandoned their plans soon after discharge, 
while the remainder were still “thinking about owning a business” 
without having taken any action. These findings indicate a lower 
level of performance than had been predicted for those who ex- 
pressed plans to own businesses. 

Compensating for the men who had abandoned plans for business 


"To classify separatees with respect to their efforts to get into business, questions 
were asked on (1) actual employment status at time of follow-up, (2) whether or not 
any arrangements had been made to buy or start a business, and (3) for those still 
planning businesses, the prospective date of getting into operation. For the detailed 
questions, see Appendix B, Questions 1, 10, 11, and 12. 
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ownership since they had returned to civilian life and for those who 
had done nothing to carry out their previously expressed intentions, 
were two groups who crystallized plans for business ownership after 
discharge. Men classified as having no immediate plans but plan- 
ning business operations within five years contributed about a fifth 
of those who actually commenced operations or made arrangements 
to do so in the near future. And, men who left the Army with only 
vague plans and those who expressed no plans at all for business 
operations contributed over two fifths of all the men who were in 


TABLE 13 


ARRANGEMENTS FOR ENTRANCE INTO BusINESs Map® wiTHIN Two To Four 
Monrss aFrer DiscHarcs BY JuLy 1945 SpPARATEES, BY PLANS 
ED AT SEPARATION 


(Per cent) 


PLANS FOR BUSINESS OWNERSHIP AT SEPARATION 
ARRANGEMENTS AFTER 


DISCHARGE Total Definite or Business Vague plans 
tentative later or no plans 
Total 100 100 100 100 
In business 2 23 2 1 
Had made some arrange- 
ments 5 17 8 3 
Planned to be in opera- 
tion within 6 months (2) (8) (3) (1) 
Planned to enter busi- 
ness somewhat later (3) (9) (5) (2) 
Thinking of owning business 
but had made no arrange- 34 31 60 29 
ments 
No apparent interest in busi- 
ness 59 29 30 67 
Number of cases 1,660 65 241 1,844 


business or verging upon entry. The contribution of men who had 
not expressed definite or tentative plans for a business while in 
service was higher than had been predicted. The relationship of 
plans to performance for individuals is given in Table 13. 

The criteria used for classification of “efforts to start operation of 
a business” did not include any specification of the characteristics 
of a “business.” That is, the respondent’s own self-classification, 
based on checking that he was ‘“‘working in [his] own business’ or 
that he had ‘“‘made certain arrangements for getting a business of 
{his] own” was accepted regardless of the type of business enter- 
prise described, just so long as the description seemed to reflect self- 
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employment. This meant, however, that some men who were 
simply hiring out their services on a day-to-day basis were included 
as having businesses of their own. Because of the difficulty of sep- 
arating the own-account worker from the proprietor, and because 
the related benefits of the Servicemen’s Readjustment Act specified 
“‘self-employment”’ rather than ownership of a “business enter- 
prise,” no attempt was made to eliminate the peddler and the odd- 
jobs man who peddled his services directly to the consumer whether 
to paint houses, trim hedges, or perform other similar jobs. 

This fact, however, explains a significant portion of the ‘‘busi- 
nesses” established by men who had not at the time of separation 
expressed plans for businesses of their own. As will be shown later, 
in discussing the characteristics of these men, the proportion of 
own-account workers was apparently much higher among the ‘‘non- 
planners” who reported themselves in business than among the 
group who carried out previously expressed plans for ownership of 
a business. 

It will be recalled that the questionnaire used in the surveys of 
separatees contained toward its close a question asking for a final 
reconsideration of plans and a choice among alternatives of employ- 
ment status. Use of this question, along with the succeeding item 
which indicated intensity of plans, yielded about the same estimate 
of the proportion of men planning a business as was obtained from 
the use of the three-question battery which included the trouble- 
some open-ended items.*? Moreover, this simple means of clas- 
sification turned out to be as closely related to performance as was 
the more cumbersome method which required a major coding oper- 
ation. Therefore, in ascertaining plans of December separatees 
for a business, only the simpler method was used. 

Because other data were more urgently required in the follow-up 
of December separatees, the specific questions asked of July sepa- 
ratees about their efforts to get into business could not be included 
in the December follow-up questionnaire. The question on em- 
ployment status at the time of the follow-up was, however, available 
in the same form as had been used for July separatees, and hence 


Those classified as having definite plans for a business indicated they were “‘very 
sure’ they would go into business for themselves; those classified as having tentative 
plans said they were ‘pretty sure’; all others were eliminated from consideration. 

% The follow-up of July separatees was designed primarily to test the adequacy of 
plans as a basis for prediction. The follow-up of December separatees, on the other 
hand, was designed primarily to obtain data useful to the Veterans Administration for 


planning purposes. 
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an index of efforts to get into business was afforded by the number 
who had either actually gotten into operation or were engaged in 
attempting to do so at the time of the follow-up. 

Table 14 presents a comparison of the extent to which July and 
December separatees managed to get into business, using responses 
to the question on current status as the sole criterion. It will be 
noted that this criterion yields a slightly higher estimate of actual 
business ownership by July separatees than was given by the more 
refined criterion used in Table 13. The explanation lies in the fact 


TABLE 14 


Business OWN&ZESHIP WITHIN Two TO Four Monrss arrsr DiscHARGE 
AMONG JULY AND DecemMBer SepaRaTEEs, BY PLANS EXPRESSED 
at SEPARATION* 


(Per cent) 


PLANS FOR BUSINESS OWNERSHIP AT SEPARATION 


STATUS AFTER DISCHARGE Definite or 
Total tentative Other 
July Separatees 

Total 100.0 100 100.0 
In business 2.8 24 2.0 
Making arrangements* 0.7 6 0.5 
Other 96.5 70 97.5 

Number of cases 1,650 68 1,688 
December Separatees 

Total 100.0 100 100.0 
In business 4.1 39 2.4 
Making arrangements 1.2 4 1.0 
Other 94.7 57 96.6 


Number of cases 2,088 96 2,003 


* Criteria different from those used for Table 13. See text for description. 


that some men who were on the verge of getting into operation (in- 
cluded in Table 13 in the category, “Had made some arrangements: 
planned to be in operation within six months’’) reported their cur- 
rent status as though they had already commenced operations. 

It will be noted, despite the similar proportions planning to own 
a business (4.1 per cent of July separatees and 4.5 per cent of De- 
cember separatees, as one may easily calculate from Table 14), De- 
cember separatees had greater success in getting into operation than 
their fellow veterans discharged five months earlier had had. 
Moreover, in two respects the accuracy of prediction from plans 
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was greater for December than for July separatees: (1) a higher 
proportion of those with definite or tentative plans carried out those 
plans and (2) a larger proportion of those who entered business was 
comprised of men who had expressed plans to do so. The reason 
seems to lie in the characteristics of the members of the two groups 
of separatees. 


Population Characteristics and Business Ownership 


In attempting to evaluate soldiers’ plans as a basis for predicting 
business ownership, particular consideration was given to such char- 
acteristics as age, education, relevant experience, and available 
capital. A comparison of the characteristics of those who achieved 
business ownership with the characteristics of unsuccessful planners 
tends to support the importance of all of these characteristics. The 


TABLE 15 


CHARACTERISTICS OF MEN WHO PLANNED TO OwN BUSINESSES AND OF OTHERS 
Wao Dip Acquire Businesses 


(December Separatees) 
(Number) 
PLANNED OWN BUSINESSES 
CHARACTERISTICS : Others who 
Successful Unsuccessful acquired 
in plans in plans businesses 
Age (total specified) 40 52 68 
Under 30 23 36 5A 
30 and over 17 16 14 
Education (total specified) 41 54 7 
Grade school 4 17 19 
Some high school 14 17 25 
High school graduate 
or college 23 20 26 
Relevant experience 
(total specified) 87 48 * 
Had own business 19 12 
Other relevant preservice 
experience 18 28 
No relevant preservice 
experience _ 8 
Available savings at sepa- 
ration (total specified) 38s 58 63 
Under $1,000 11 28 36 
$1,000 up to $2,000 8 15 18 
$2,000 and over 14 10 9 


* Data on experience were secured only from men who planned to own businesses. 
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number of cases involved is small, but the data is presented 
because of their consistency and the magnitude of some of the dif- 
ferences (Table 15). 

It will be noted that among men who had planned business own- 
ership, those over thirty, high school graduates, those with previous 
experience acquired in their own businesses, and those with savings 
of two thousand dollars or more were, in the small samples available, 
far more successful than those younger, less well educated, less ex- 
perienced, and less well supplied with savings. 

An examination of the characteristics of men who had not defi- 
nitely or tentatively planned businesses of their own, yet on the 
follow-up reported themselves as owning a business, reveals that 
they were in many respects closer to the unsuccessful planners than 
to the men who achieved success in their plans for business owner- 
ship. This fact may be regarded as further evidence that the ‘‘odd- 
jobs” sort of self-employment rather than proprietorship of a busi- 
ness establishinent bulks rather large in the group. Since the sepa- 
ratees were not asked to describe their businesses in the December 
follow-up questionnaire, there is no way of eliminating the odd-jobs 
men masquerading as entrepreneurs. But the fact that savings of 
less than one thousand dollars were reported by more than half of 
the men who later claimed to be working in their own businesses, 
even though they had not previously expressed plans to enter busi- 
ness, would suggest that many of the business operations were on 
an exceedingly small scale. It is probable, of course, that a few of 
the men classified as successful in their business plans would fit into 
the same category, but the comparative characteristics suggest that 
this happened less frequently among the men who had planned busi- 
nesses than among the others who reported that they had acquired 
their own businesses. 


Adequacy of Plans Classification 


It will be recalled that two methods of classification of plans for 
business ownership were used. The first utilized a battery of three 
questions—the lead-off write-in question, choice among alternatives 
of employment status, and a direct question on plans for business 
ownership—and was based upon consistent responses and degree of 
definiteness in the answers to all three questions. The second 
method was based upon use of a choice among alternatives of status 
coming at the close of the questionnaire, and an indication of the 
degree of certainty with which plans for business ownership were 
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held. These methods gave similar estimates of the number of men 
leaning more strongly toward business ownership than toward any 
other area of plans. Nevertheless, using a fairly rigorous criterion 
of efforts to go into business, less than half the men who had leaned 
most strongly toward business ownership carried out or sustained 
their plans in the first few months after discharge. Moreover, this 
group of successful planners comprised less than two fifths of the 
men who reported themselves to be in business or in process of mak- 
ing arrangements to start a business. Even if they comprised a 
somewhat larger proportion of the men who were proprietors of sig- 
nificant business enterprises, the relationship between the plans 


TABLE 16 


RELATIONSHIP BETWEEN ANSWERS TO DiREcT QUESTION ON PLANS FOR A 
Businuss AND PosTsEPARATION Business OWNERSHIP 


(December Separatees) 
(Per cent) 
Question: “Do you plan to buy or POSTSEPARATION PERFORMANCE 
start a business of your 
own soon after you 
leave the Army?” In Making 
Total business arrangements Other 
Total 100.0 4 1.2 94.7 
Already own a business 2.8 1.3 0.1 14 
Almost sure will buy or start a 
business 9.1 11 0.3 7.7 
May (have own business) but not 
sure 22.6 0.7 0.4 21.5 
Don’t plan business 65.5 1.0 0.4 64.1 


Total number of cases 2,098 


classification used and postseparation performance is considerably 
less impressive than in the case of plans for education or for return- 
ing to work for a preinduction employer. To what extent may this 
be attributed to inadequate data on and classification of soldiers’ 
plans? 

Table 16 shows the relationship between answers to the direct 
question on plans for a business and later performance. It will be 
apparent that predictions based upon this question would have 
greatly overstated the number of men likely to go into business in 
the first few months after discharge. Although more than three 
fourths of the men who reported themselves as working in their own 
businesses at the time of the follow-up had given some indication 
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of plans for a business in answer to the direct question, more than 
four fifths of those who had said they were “‘almost sure” they would 
buy or start a business failed to carry out their plans soon after 
discharge. 

The method of classification of plans which had been used in pre- 
paring estimates of the extent of business ownership to be attempted 
by veterans was much closer to the mark of their early postsepara- 
tion efforts than an estimate based solely on the direct question 
would have been. And, as has been mentioned earlier, the use of 
the three-question battery gave classifications which agreed in the 
great majority of instances with the classification derived from an 
interview with the respondent after he had filled out the question- 
naire. 


Plans and the Situation Encountered after Discharge 


The basic problem involved does not seem to be the classification 
of plans but rather the nature of the plans or attitudes held by many 
soldiers. There was a great deal of interest in and yearning for be- 
comimg the proprietor of a business, and some men had fixed their 
eyes on this goal without any real knowledge of the problems to be 
encountered before the goal could be attained. Other men, equally 
desirous of becoming proprietors but better able to see the difficul- 
ties ahead, expressed interest in business ownership but definite or 
tentative plans to do something else first. If certain contingencies 
occurred, they would proceed in one direction ; if other contingencies 
materialized, they would proceed in a different direction. Some 
men counted on a loan under the terms of the Servicemen’s Read- 
justment Act; the denial of the loan might mean yielding their plans. 
Others who had not counted upon being able to finance a business 
apparently found that favorable circumstances existed—perhaps a 
chance at a partnership with another veteran—and got started 
sooner than they anticipated. 

The follow-up questionnaire for July separatees asked the men: 
“If you were planning to have your own business, have you run into 
any difficulties you had not expected while you were in the Army?” 
Of the men who had planned business ownership and had succeeded 
in carrying out their plans, half said they had encountered unex- 
pected difficulties. Of those men who had not expressed plans for 
business but who reported that they nevertheless had started busi- 
nesses, only two in eighteen said they encountered unexpected diffi- 
culties. This suggests that in considerable measure these men who 
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had not planned businesses found themselves face to face with 
opportunities which crystallized their vague plans into plans of 
action. 

To the veteran who wished to go to school, by and large, the op- 
portunities were available. To the veteran who wished to return 
to his old job, by and large, the job was offered back to him. But 
to the veteran who wished to own a business, there were more diffi- 
cult conditions to be met, arrangements to be made, material short- 
ages to be overcome. Certainly, a period of three or four months 
was entirely inadequate to permit a fair evaluation of efforts toward 
business ownership. Yet even this brief period was sufficient to 
demonstrate that data on plans alone would not yield an accurate 
estimate of the number of men who would try to start businesses 
of their own. 

It is worth looking into the possibility that a more accurate esti- 
mate might have been derived from some other method of estima- 
tion. Making certain assumptions, one might have projected past 
trends in self-employment by age groups to the veteran population 
in the postwar period. It will be recalled that the application of 
rates of nonagricultural self-employment by age, as derived from the 
1940 Census of Population, suggested that five years of aging of the 
Army population as constituted in 1944 would, under prewar condi- 
tions, have added about 300,000 to the number self-employed. 
This was roughly 4 per cent of that Army population. Adding 
this group to the 3 per cent who had been self-employed at the start 
of the five-year period, one might have predicted that it would be 
1949 before as many as 7 per cent of the Army’s male population 
would be self-employed, had there been no war and no change in 
the general economic scene. Going on from this point, one would 
have had to weigh the influences of the war, likely to prevail in the 
postwar period, upon the self-employment tendencies of veterans. 
As has been indicated, this is essentially what was done. 

The data on soldiers’ plans were confronted with the best avail- 
able evidence on probable business opportunities and on the com- 
petitive ability of veterans to capitalize on those opportunities. 
Although no exact figure was arrived at, this analysis did suggest 
that the veteran population’s share in self-employment would be 
less than soldiers’ plans indicated. Nevertheless, the data on plans 
were used to give a maximum estimate. In the absence of a mini- 
mum estimate, this probably resulted in more unrealistic planning, 
for the immediate demobilization period, in some quarters than 
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would have been carried out if no data on plans had been available.™ 
It is still too early to say what the long-term effects of military 
service in World War II will be on the self-employment of the vet- 
eran population. 


SECTION IV 


PLANS FOR GOVERNMENTAL EMPLOYMENT 


Nearly 10 per cent of the men surveyed in the large cross-sectional 
survey conducted during the summer of 1944 reported that they 
had worked in governmental jobs before entering the service. They 
constituted roughly one eighth of all enlisted men and officers who 
had been employed prior to entering the service. A considerable 
proportion of these men, however, were engaged in activities which 
were not normally governmental functions—they were employed 
in war plants run by the Federal government. Almost certainly 
some of the men who classified themselves as having been govern- 
mental employees were actually employees in private war plants 
holding governmental contracts. Thus, many of the men who 
reported themselves as having been governmental employees had 
held jobs which were unlikely to exist after the termination of 
hostilities. 

Certainly the greatest attraction of a governmental job was the 
security that such a job might offer. Many of the men in service 
were old enough to have experienced the insecurity of the thirties. 
Since veterans received preference ratings for jobs with the Federal 
government and with a number of state governments, it seemed 
likely that many would make an effort to get jobs with one or an- 
other branch of government. 

The task of estimating the number of men planning to work for 
governmental bodies was seriously complicated by an unexpected 
finding: many men had misconceptions as to what constituted a 
government job. Such misconceptions introduced errors both in 
the reporting of jobs held prior to entering the service and in the 
reporting of postwar plans. In the former instance, however, it 
was frequently possible to classify a man correctly on the basis of 
his job description, ignoring in such cases the type of employer 


* On the other hand, prior to the studies conducted by the Research Branch, fan- 
tastically high estimates of veteran self-employment were reported by some prog- 
nosticators who had talked with a few men in uniform and had been impressed with the 
high level of aspirations for business ownership. 
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for whom he reported he had worked. Correction or allowance for 
the men’s misconceptions were much more difficult when dealing 
with postwar plans, however. For in those instances where a man 
described a job which would rarely be found in governmental serv- 
ice, there was no way of knowing whether the particular job or the 
status of government employment was his chief aim. Another 
source of misconception made it difficult to determine whether men 
were planning Federal, state, or local employment: many men 
simply did not know what jobs were offered by what governmental 
bodies. Some, for example, said they wanted jobs in city govern- 
ments and then specifically mentioned post-office employment. 
Others were under the impression that teaching jobs were offered 
by the Federal government. 

Aside from the extent to which these misconceptions complicated 
the classification of the men, they were in themselves important 
clues to general attitudes toward governmental employment. To 
many soldiers, ‘government’ apparently connoted Washington, 
politics, and bureaucracy. These men did not think of the varied 
jobs offered by the Federal government (as well as by state and 
local governments) in all parts of the country. By contrast, for 
other soldiers, governmental employment connoted the security 
which seems inherent in large-scale operations, whether in the 
postal service, ship-building, or the field of transportation and com- 
munication. It appears also that governmental regulation of cer- 
tain phases of transportation and communication resulted in a 
fairly widespread belief that these facilities are governmentally 
operated. Many soldiers were simply not in a position to under- 
stand the functioning of their government, possessing as they did 
such meager information about it. And certainly a great number 
of them did not have adequate information about government jobs 
to enable them to evaluate the opportunities such jobs might offer. 


Ascertaining Plans for Government Jobs 


In addition to the troublesome problem of misconception about 
government jobs, the ascertaining of soldiers’ plans for such jobs 
was complicated by the fact that here again aspirations greatly ex- 
ceeded informed plans. Thus, in the major survey conducted in 
the summer of 1944, two fifths of all enlisted men said that they 
“would like” to have a government job. Included in this figure 
were 4 per cent who reported that they had a government job wait- 
ing for them, 12 per cent who said that they would definitely try to 
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get a government job, and 24 per cent who said, “I would like a 
government job, but I am not sure I will try to get one.’”’ Even the 
16 per cent who indicated that they would have such a job waiting 
for them or would definitely try to get such a job was considerably 
greater than the number who indicated in answer to an earlier ques- 
tion about the type of employer they expected to work for, “I ex- 
pect to work for the government’’—9 per cent. 

Because interest in the security presumably attaching to govern- 
ment jobs caused many men to be suggestible when asked if they 
planned to try to get such jobs, classification of plans in this field 
was based primarily on the pattern of responses given to three ques- 
tions. Those men who indicated in answer to the general write-in 
question that they had definite plans for a job, who indicated when 
asked to choose among the alternatives of employment status or 
class of employer that they planned to work for the government, and 
who answered the direct question as to their interest in government 
jobs by saying that they either had such a job waiting for them or 
would definitely try to get a government job, were classified as hav- 
ing definite plans for a government job, provided that there was no 
inconsistency between such plans and their reported interest in re- 
turning to work for their preservice employers. The evoking of 
these criteria left only 4 per cent of white enlisted men classified as 
having definite plans for government jobs, while another 5 per cent 
were classified as having tentative plans for government jobs, but 
plans into which some note of uncertainty or inconsistency had 
crept. 

Subsequent pretesting revealed that slight changes in the wording 
of the direct question relating to interest in a government job would 
markedly modify the distribution of responses to the question. 
Copies of the questionnaire used in the survey conducted in the 
summer of 1944, interleaved with copies of a pretest version of the 
final questionnaire, were administered to roughly four hundred en- 
listed men at a single Army post in the spring of 1945. The dis- 
tribution of responses is given below along with the questions and 
the answer categories. Question A is the form used in the 1944 sur- 
vey ; question B is the form used in the final survey conducted among 
separatees. 


A. Do you think you will try to get a government (Federal, state, city, 
etc.) job after the war? (Check one) 
1 I have a government job waiting for me which 
I expect to take 10% 
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2 I am definitely going to try to get a government 
job 15 
3 I would like a government job, but I am not sure 
I will try to get one 25 
4 I would try if I couldn’t get some other job 
I would like 15 
5 T am not at all interested in a government job 31 
No answer 4 
100% 


B. Do you think you will try to get a job with some branch of the govern- 
ment (Federal, state, city, etc.) right after you leave the Army? 


(Check one) 
1 T left a government job to enter the Army, and I 
expect to return to it 7% 
2 —_____ I am definitely going to try to geta governmentjob 10 
3 _____ I will probably try to get (or return to) a govern- 
ment job, but I am not sure 
4_____I may try to get a government job 30 
5 ____ I am not interested in a government job 4) 
No answer 5 


100% 


It was felt that the term a “government job” would be more likely 
to evoke a stereotyped reaction than the phrase “a job with some 
branch of the government.’”’ Again, although many men who had 
been governmentally employed said colloquially, “‘I have a govern- 
ment job waiting for me,’”’ and therefore this wording had been used 
for the first answer category in the earlier survey, interviewing re- 
vealed that some men who checked the category meant that they 
had been promised a government job by some friend or relative. 
Thus the revision of the first answer category was in the direction 
of greater specificity and the elimination of ambiguity. Since the 
objective of these surveys was to attempt prediction of the number 
of men who would try to get government jobs, it seemed desirable 
to word the third answer category in terms of such efforts rather 
than in terms of general interest. In both instances the question 
categories afford a crude scale of interest in governmental employ- 
ment. In the case of the final form of the question it was thought 
that the cutting points yielded by the various answer categories 
would be more valuable than those yielded by the former question. 

In the survey of separatees, the classification of plans for govern- 
mental employment was based upon the direct question cited above 
and upon a question placed earlier in the questionnaire inquiring as 
to the type of employer for whom the man expected to work— 
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whether a branch of the Federal government, a branch of a state 
government, or a city, town, or county government. Among July 
separatees nearly 15 per cent said they were definitely going to seek 
a job in some branch of government or return to such a job previ- 
ously held. But only 8 per cent had also previously indicated that 
they expected to work for a branch of the government. 

Table 17 presents the results of a cross tabulation of the question 
on type of employer (Q 27) and the direct question on plans for a 
government job (Q 46). Most of the men who had checked that 


TABLE 17 


INTERRELATIONSHIP OF RESPONSES TO Two QUESTIONS 
RELATING TO INTEREST IN GOVERNMENT JOBS 


(July Separatees) 


(Per cent) 
Q 46: Do you think you will try Q 27: What type of employer do you expect 
to get a job with some branch of to work for? 
the government? 
RESPONSE CATEGORY CHECKED 
RESPONSE CATEGORY CHECKED Branch Private Undecided, 
Total of gov't employer No employer, 
No answer 
Total 100.0 12.1 49.6 $8.3 
Plan to return to gov’t job previ- 
ously held 2.8 2.2 0.2 0.4 
Definitely going to try to get 
gov't job 11.8 6.1 3.0 2.7 
Will probably try 8.0 1.4 3.2 3.4 
May try 36.5 1.8 20.1 14.6 
Not interested in gov’t job or No 
answer to question 40.9 0.6 23.1 17.2 


they expected to work for a branch of the government gave con- 
sistent indication of interest in governmental employment when 
asked the latter question. It would appear, however, that the direct 
question exerted a strong element of suggestion on a considerable 
number of men who had earlier indicated either that they planned 
to work for a private employer or that they had no idea what type 
of employer they would work for. Thus responses to the earlier 
question indicated that about 12 per cent of the men might be ex- 
pected to make a definite effort to get government jobs and that, 
in addition, some of the men who were undecided as to type of em- 
ployer (about 20 per cent) might also become interested in govern- 
ment employment. Responses to the direct question alone sug- 
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gested that nearly 23 per cent of the men might be expected to make 
some effort to get a government job and that another 36 per cent 
might be interested in governmental employment. It seems appar- 
ent that a large number of the men who were not planning to go to 
school, to return to their previous employer, or to try to start a busi- 
ness—the men who would be seeking a job with a new employer— 
might be interested in certain government jobs but simply were not 
planning definitely to work for any specific employer or type of 
employer. Many, in fact, indicated that they wanted to “shop 
around’ and then take the best opportunity that was open to them. 
One might predict that a majority of the 8 per cent who were con- 
sistent in their expression of intentions to seek or return to a govern- 
ment job might do so, but there was no way of estimating how many 
additional soldiers might decide to go after government jobs. 

Because of the change in the methods of classification of plans for 
government jobs, it was not possible to ascertain whether or not 
there was a shift in plans between the summer of 1944 and the sum- 
mer of 1945. On the basis of a comparison of responses to similar 
questions, it appears that if a change did occur it was not great. 
December separatees reported plans for governmental employment 
in roughly the same proportions as did July separatees. 

Performance of July Separatees 

The follow-up of July separatees contained two questions relating 
to efforts to get a job with some branch of the government. Because 
of space limitations and the more urgent need for other types of 
data, the follow-up of December separatees did not inquire into 
efforts to obtain a government job. 

The basic question used to classify July separatees according to 
their efforts to get a government job, and the percentages of indi- 
viduals who checked each answer category are given below: 

Have you tried to get a job with any branch of the government (Federal, 
state, city, etc.) since you got out of the Army? 
Yes, and I am already working for a branch of the gov- 


ernment 8.6% 
—__—.. Yes, I have my application in for a government job 

and will take the job if they offer it to me 7.1 
—___.. I did apply for a government job, but am not sure I 

want it now 41 

No, I haven’t tried yet, but I plan to 29.1 

No, I am not interested in a government job 45.4 

No answer 5.7 
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Roughly a fifth of the July separatees applied for government jobs 
in the first two to four months after discharge. Since many Federal 
agencies were building up staffs to handle phases of veteran aid dur- 
ing demobilization, there was a large supply of temporary Federal 
jobs available to veterans discharged early in demobilization, even 
though total Federal employment was soon to decline rather sharply. 
Notwithstanding this fact, the postal service far outstripped other 
governmental employers of July separatees. Next in importance 
were jobs connected with Army and Navy installations, including 
separation centers. City and county maintenance and protective 
jobs were the chief source of public employment other than Federal. 


Prediction for the Individual Separatee 


Among men who had consistently indicated at the time of separa- 
tion that they intended to seek a government job, three fifths had 
applied for a government job and another fifth said they still planned 
to apply when surveyed in the follow-up of July separatees. A 
fifth had apparently lost interest within the first two to four months 
after discharge. Among those who had indicated an intention to 
seek government employment in answer to one of the two classifica- 
tory questions but had not given a consistent answer to the other 
question, a fourth applied for government jobs within the first two 
to four months after discharge and over two fifths reported to the 
follow-up that they had not “tried yet, but [still] plan to” try to get 
a job with some branch of the government. Even among the large 
group who had stated at separation either that they were not inter- 
ested in a government job or merely that they might try to get a 
government job, fully a seventh of the men did apply for some form 
of public employment within the first few months after discharge. 
Table 18 gives the relationship between plans for governmental em- 
ployment and postseparation performance. 

Because of the magnitude of the segment of the population report- 
ing at separation, ‘‘I may try to get a government job” (more than 
a third of the total survey group), the men classified as having only 
vague plans or no plans for a government job actually contributed 
a little more than half of the applicants for governmental jobs soon 
after discharge. The men with tentative or inconsistent plans (16 
per cent of the survey group) contributed a fifth of the applicants 
for government jobs, while the 8 per cent who had expressed definite 
and consistent plans contributed slightly over a fourth of the appli- 
cants. 
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Thus, predictions that a given individual would or would not seek 
a government job yielded a low degree of accuracy when based upon 
the soldier’s plans. Moreover, there is no reason to believe that 
much better predictions could have been made for individuals at 
the time of Army discharge. An interviewer, ascertaining what 
specific expectations the individual separatee had, with respect both 
to job wanted and to government employment opportunities avail- 
able, might have scored a slightly higher proportion of successful 
predictions, but it is doubtful that any considerable gain in accuracy 
could have been made. It would appear that a firsthand knowledge 


TABLE 18 


PosTsEPARATION EFrrogrs TO OsTAIN A GOVERNMENT 
Jos sy PLANS EXpREssep aT SEPARATION 
(July Separatees) 

(Per cent) 


PLANS FOR GOV'T JOB AT SEPARATION 


POSTSEPARATION EFFORTS Definite Tentative Vague plans 
and or or 
consistent inconsistent no plans 

Total 100 100 100 
Employed in gov’t job 30 9 6 
Applied for gov’t job 29 16 8 
Would take job if offered (17) (9) (5) 
Application rejected (2) (1) —_— 
Not sure still want job (10) (6) (3) 
Plan to apply 20 44 27 
Not interested in gov’t job 15 26 54 
No information 6 5 5 
Number of cases 188 278 1,236 


of available opportunities—both in private industry and in govern- 
ment—which was secured from active job-seeking after discharge, 
was the crucial element in leading to a final decision between gov- 
ernment and private employment. The man who knew he wanted 
government employment could more often be counted upon to make 
an active search for such employment than could the man simply 
interested in “‘a job”; but, even for the more interested men, there 
was frequently the problem of applying and qualifying well be- 
fore the job became available. Many of these men indicated in 
their free comments at the close of the follow-up questionnaire that 
they simply could not wait for months while Civil Service registers 
were being prepared. On the other hand, there were many men 
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sufficiently interested to apply for a government job even though 
they were temporarily returning to work for their previous employ- 
ers, or for some other private employer, permitting the possibility 
of making a shift later on. 

The prediction of the proportion of all veterans who might seek 
government jobs has, nevertheless, been reasonably accurate, ac- 
cording to statistics on Federal employment. As of August 1948, 
veterans and persons entitled to veteran preference numbered 846,- 
001 out of 1,895,443 Federal employees in the continental United 
States.25 Over 560,000 of these were male veterans of World War 
II. 

It will be recalled that 4 per cent of the soldiers surveyed in 1944 
expressed definite plans and another 5 per cent expressed tentative 
plans to secure a government job; of these, 60 per cent were pri- 
marily interested in Federal employment. In the two years fol- 
lowing virtually complete demobilization of the wartime Armed 
Forces, roughly 4 per cent of all male veterans secured Federal em- 
ployment. Thus, the high level of interest in Federal jobs expressed 
by soldiers and separatees seems definitely to have carried through, 
despite the fact that jobs in private industry were plentiful. 


SECTION V 


GENERAL OBSERVATIONS AND CONCLUSIONS 


The detailed discussion of the problems of attempting to ascertain 
soldiers’ plans and of the relationship of expressed plans to actual 
postseparation performance has demonstrated the complexity of 
one type of prediction problem. The particular prediction problem 
dealt with here may not soon be encountered again. But this type 
of problem will probably become increasingly frequent. Large- 
scale social planning seems more and more inevitable, and large- 
scale planning, in a democracy, involves predicting how people will 
react to a given program. Efficient operation without coercion 
means anticipating needs before they become acute, and providing 
means of meeting them. Every time a program is proposed which 
will affect the opportunities or alternatives available to people who 
are faced with a problematic situation, planning will require an 
assessment of the probable choices which people will make in the 
changed situation. 


% According to a press release made by the Civil Service Commission on October 8, 
948. 
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What generalizations can be drawn from the present study as a 
basis for better predictions in other studies of the same type? It 
may be noted that generalizing for such purposes is itself a predic- 
tive operation. One predicts that if a given prediction problem is 
analyzed in a stated way, using specified techniques, the resulting 
predictions will be more accurate than if a different type of analysis 
had been used. The problem takes on added interest after the fail- 
ure of the public opinion polls to predict the outcome of the 1948 
Presidential election. It seems safe to assume that the polls erred 
significantly not only in sampling but also (and perhaps primarily) 
in gauging the disposition of the voter. One may ask, then, what 
techniques are known that would permit more accurate prediction? 
No attempt will be made here, however, to analyze the errors of the 
polls or to apply the generalizations here formulated to the problem 
of predicting elections. 

It should, nevertheless, be noted that when these generalizations 
are applied, they become statements of prediction, and they are 
subject to errors of prediction. They may be expected to give accu- 
rate results to the extent that the factors involved in the new prob- 
lem area are similar to those which were encountered in the predic- 
tion of soldiers’ postseparation behavior. Many more studies of 
this type will be required in order to elaborate and qualify properly 
the generalizations derived from the present study. 

The main methodological conclusions which may be drawn from 
the studies of soldiers’ postwar plans may be divided into two parts: 
(1) those relating to the technique of ascertaining plans and assessing 
intentions; and (2) those relating to the prediction problem itself. 


Soldiers’ Plans and Questionnaire Methodology 


It has already been observed that questionnaires rather than in- 
terviews were used to obtain the basic data, not because the ques- 
tionnaire method seemed intrinsically better, but because it was 
the only feasible method of securing adequate data from a large 
cross section of the Army. Without protracted intensive inter- 
viewing, a suitable questionnaire could not have been formulated. 
Interviewing revealed the extent of uncertainty and some of the ele- 
ments considered by the soldier in formulating his plans. The 
basic nature of these plans was that they were complex, shifting 
orientations to a future situation which could only be imagined. 
They were not unitary fixed attitudes which could be “‘measured” ; 
they were not orientations which could be given by an automatic 
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response to a stimulus question asking what the soldier planned to 
do. They involved reasoning through and selecting from a number 
of alternatives—and if they were to be realistic they had to involve 
an awareness on the part of the soldier of what he would want at 
some future date when the environmental pressures upon him would 
be quite different from those experienced while in service. 

It seemed imperative, therefore, that the questionnaire be de- 
signed in such a way as to encourage the respondent to reflect on 
various alternatives and to state his general expectations before 
seeking answers about detailed aspects of his plans. In addition, a 
knowledge of how he reacted to questions seeking the details of his 
plans was itself essential in order to classify those plans as to realism 
and consistency. In this sense the questionnaire was designed 
similarly to the so-called “funnel-shaped interview’ used in nondi- 
rective interviewing. In the questionnaire those items dealing 
with details of plans to enter a given field after the war were ar- 
ranged in blocks to be answered only by men who were seriously 
contemplating such entry. 

The effects of context as relating to general questionnaire organi- 
zation may be briefly summed up. Whether one led off with a bat- 
tery of questions on previous experience, on personal characteristics, 
or on postwar plans, did not apparently have any marked effect 
upon the reporting of plans, other things being equal.?® But the 
way in which the batteries of questions on postwar plans were de- 
veloped did affect the reporting of plans. An attempt to work 
from aspirations to reasoned statements of intentions was confusing 
and failed to bridge the gap between aspiration and realistic possi- 
bilities. A relatively nondirective question used as a lead-off in 
the area of plans was more successful in minimizing the reporting of 
aspirations than were other types of questions. Questions relating 
to the degree of certainty of plans as reported gave a basis for an- 
alyzing factors associated with definiteness of intentions. 

Working from the general to the specific and detailed, the ques- 
tionnaire was designed to obtain considerable detail on plans to 
enter fields of high prestige value. In this way data on plans could 


™ It may be noted, however, that in the pretest which utilized four general question- 
naire designs administered to matched groups of soldiers, answers to a series of ques- 
tions on personal adjustment were affected by placement in the questionnaire. When 
carried at the close of the questionnaire, which dealt primarily with postwar plans, the 
questions were answered much more negatively than when placed early in the form. 
A possible explanation is that the consideration of postwar plans was depressing to 
some men. 
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be evaluated as a basis for prediction in the light of the experience, 
training, and financial resources possessed by the respondent. 
Moreover, after having led the respondent to weigh these factors in 
his own thinking, these blocks of detailed questions gave him an 
opportunity to state whether he might do something else, immedi- 
ately after separation, before attaining the status aspired to. Thus, 
he was given an “out’’ which did not damage his prestige but sug- 
gested a temporary compromise. This proved to be an extremely 
effective way of screening out a number of men who had otherwise 
consistently expressed plans for business ownership and farm man- 
agement. In the first few months after separation, very few of 
these men did enter the activities they aspired to. 

In addition to considering the effects of question order and overall 
questionnaire design, it may be worthwhile to summarize the relative 
advantages and disadvantages of the different types of questions 
asked in these studies. 


Advantages and Disadvantages of the Various Question Forms 


The open-ended, nondirective question. ‘The nondirective question 
does not force the respondent to answer in a narrowly specified di- 
mension. All questions which require the respondent to choose 
among a set of suggested responses are excluded, as are all other 
questions which suggest the phraseology of response. In a sense, 
as will be discussed shortly, a question which does not indicate the 
broad field of subject matter in which the response is sought is a 
meaningless question for classification purposes (except as it may be 
valuable for diagnostic purposes in psychological study). There 
was, however, a great range in the degree of specificity of the ques- 
tions used in attempting to ascertain soldiers’ plans. 

The most general question of this sort—i.e., most completely 
nondirective—was attempted in an early pretest. The question, 
which was aimed to obtain a classification of plans by employment 
status and broad occupation was: ‘‘What do you think you will do 
right after the war?’ This question proved to be of little value 
because nearly half of the respondents answered in such terms as 
“settle down to a quiet life,” ‘‘raise a family,” etc. In an interview 
such irrelevant answers (from the point of view of classification of 
employment plans) would of course be followed up by a directive 
probe such as, “I mean your plans for work or school—what are 
they?” In the questionnaire, at least this minimum specification 
of dimensions was necessary in the original question. 
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The nondirective question has the great advantage of not sug- 
gesting an answer to the respondent and (ideally) of getting at the 
respondent’s own frame of reference. It has several disadvantages 
when used for a questionnaire to be filled out by the respondent, 
particularly if the respondent be a person with only grade school 
education and no great facility for setting down words on paper. 
The question calling for a written answer must be coded in order to 
classify the response. This is a time-consuming process and one 
which may introduce a considerable element of judgment and hence 
a considerable danger of errors of classification. Moreover, if the 
question is nondirective, some respondents will not respond along 
the desired dimension. And if dimension is made sufficiently ex- 
plicit, the element of suggestion may occur. 

Two forms of the general open-ended question were used in the 
major versions of the questionnaire. 


A. What kind of work do you think you will do right after the war? 
(Write the name of the job and describe it as fully as you can.) 


B. What kind of work do you plan to do after you get your discharge 
and take a vacation? In your own words describe what you think 
you will do for the first few years of your civilian life. Tell whether 
or not you have some definite job lined up, write the name of the job, 
and describe it as fully as you can. If you do not have a definite 
job lined up, tell what kind of work you will probably do. If you 
plan to take one job at first and change jobs later, tell what you 
plan to do first and also what you will try to changeto. Ifyou are 
undecided, tell what you have been thinking you might do. 

(Try to write fully enough about your job plans for the next few 
years so that we will understand them about as well as you your- 
self understand them at the present time.) 


It will be noted that possible plans for school attendance were not 
mentioned in either version of the write-in because it was found that 
the mention of school served to increase the inconsistency between 
expressed plans for school and those for employment. Neverthe- 
less, the exclusion of any mention of school does serve to bias the 
question slightly against the expression of educational plans, so for 
this reason the write-in was not used in the analysis of plans for 
school attendance. 

The earlier version of the write-in (A) sought to require as little 
reading as possible, in order not to discourage men with little educa- 
tion. Subsequent pretesting showed, however, that the second 
form (B) would elicit a fuller response which lent itself more readily 
to unambiguous coding as to employment status and occupation. 
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Even in the case of the fuller answers, 2 per cent were ambiguous as 
to employment status and 13 per cent were ambiguous as to broad 
occupational category (single digit code), excluding from considera- 
tion an additional 13 per cent considering alternative plans. In 
neither case was it possible to use the write-in as a basis for ascer- 
taining for a majority of respondents whether or not they were 
planning to return to a previous employer. 

The direct check-list question. The direct check-list question spe- 
cifies the precise dimension along which response is wanted and, 
except for nonresponse, permits an unequivocal classification of all 
members of the population with respect to responses along this di- 
mension. “Do you think you will actually go to full-time school 
right after you leave the Army?” gets a direct self-prediction with 
respect to full-time school. There is little danger that the respond- 
ent will answer in terms of his job plans after he completes school 
(although ambiguities may of course occur in the case of poorly 
worded questions or areas of ignorance). Further, the question is 
simple and nonresponse is held toa minimum. There is no coding 
problem, except for the man who writes a qualification of his answer, 
and these are few. 

But the direct check-list: question does have disadvantages, not 
the least of which is the large element of suggestion it contains for 
some men, particularly if such a question is the first one asked in a 
sequence of questions on plans. A series of direct questions relating 
to plans for school, for a business, and for returning to work for pre- 
induction employer showed a substantial number of men who were 
inconsistent, saying first that they would go to school and then that 
they would return to work for their former employer or do something 
else which would conflict with plans for full-time school. Such 
inconsistency could be resolved only by getting a direct indication 
of choice between the various alternatives under consideration. 

The direct question might either offer a “‘yes’” or ‘‘no” choice, 
present different aspects of a problem (e.g., contingencies), or at- 
tempt to get at the intensity of plans.2”7 The following three vari- 
ations of basically the same question were used in pretests to develop 
a battery of questions on educational plans: 

A. Regardless of what you want to do, do you think you will actually go 
back to full-time school or college after the war? 
Yes 


No 
Undecided 

= Contingency questions will be discussed in detail in a later paragraph, as they 
involve a special problem. 
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B. Regardless of what you would like to do, do you think you will actu- 
ally go back to school or college after the war? 

Yes, to full-time school or college 

—___— Yes, to part-time school or college 

No, I don’t think I will go back to school or college 

Undecided 


C. Do you expect to go back to full-time school or college after you leave 
the Army? 

Yes, I am almost sure to go to full-time school or college 

——__— I may go to full-time school or college but I’m not sure 

______ I am planning to go to part-time school 

No, I don’t think I will go back to school or college 


Question A appeared to be somewhat frustrating to the man inter- 
ested in attending part-time school or college after discharge. In 
some instances it caused such men to check ‘“‘undecided”’ rather 
than “no,” and in a few instances caused men who elsewhere indi- 
cated plans for part-time school to check ‘‘yes,” although they were 
clearly not planning to attend full-time school. 

Question B avoids the frustrating element of question A. On the 
other hand, the interpretation of the “undecided” category becomes 
dubious unless one is using other questions to indicate whether the 
indecision is between full-time school and part-time school or per- 
haps between part-time school and no school at all. As one item 
in a scale relating to educational plans this item proved satisfactory. 

Question C avoids both the frustrating element of question A and 
the interpretation problem of question B. But the introduction of 
the second category relating to plans for full-time school still leaves 
a problem of deciding whether individuals who check this category 
are more interested in attending school than in some other possible 
course of action after discharge. 

Choice among a check list of alternatives. The check-list question 
presenting alternatives was of greatest value in providing the cri- 
terion of classification for individuals considering alternative courses 
of action but leaning more heavily to one than to any other. It had 
certain additional qualities to recommend it: namely, it could be 
used to call to mind alternatives which were under consideration by 
the respondent but which may not have seemed to him to be in- 
cluded under the specification of the question, and it could be used 
to spot respondents who were basically undecided as to employment 
status, without having to do an elaborate coding job. On the other 
hand, it did not wholly avoid the suggestion factor since there ap- 
pears to have been a tendency to check prestige items rather than 
items under primary consideration in more sober expectations. 
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This was especially true in the instance of pretest forms where the 
choice of alternatives preceded a nondirective question calling for 
@ write-in answer. 

Although the choice among alternatives seems to have provided 
the most satisfactory means of establishing the general plan toward 
which the respondent was leaning most strongly at the time of the 
survey, it has been noted that this question did not necessarily pro- 
vide the best cutting point for purposes of prediction. A man lean- 
ing strongly toward an unrealistic plan but considering also a real- 
istic plan—judging realism in terms of prerequisites of background 
and experience—was in many instances more likely to shift his ori- 
entation in the direction of greater realism after discharge. Unfor- 
tunately, the number of cases involved for any specific pair of alter- 
natives was too few to establish actual probabilities in these studies. 

The position in the questionnaire of a question presenting a choice 
among alternatives was found to affect the distribution of responses, 
but the differences in predictive value of the two sets of responses 
were not consistent. It had been thought that the choice among 
alternatives when presented late in the questionnaire would entail 
more careful thought and would therefore be more highly predictive 
than when the choice was presented early in the questionnaire. 
This hypothesis received support from the higher predictive value 
of the terminal question in the case of plans for business ownership, 
but the reverse was true in the case of plans for school. The value 
of the terminal question was also attested, however, by the fact that 
it yielded as good a prediction of returns to work for a previous em- 
ployer as did the two direct questions on this subject. Perhaps most 
important, it was found that by the use of a final question involving 
a choice of alternatives, a reasonably good classification of plans to 
enter areas of high prestige value could be made at an enormous 
saving of effort as compared with the coding of write-in questions 
and the cross tabulation of a large number of questions relating to 
single areas of activity. While the data do not support a clear-cut 
generalization as to the predictive superiority of such a question 
when carried late in the questionnaire, the possible advantages seem 
to outweigh the disadvantages. On the positive side—in support 
of the notion that more accurate predictions may be anticipated 
from responses to the final question—is the fact that the respondent 
has been given a chance to review the crucial factors in the criterion 
situation and to weigh his own capabilities, limitations, competitive 
position, ete. On the negative side, there is always a possibility 
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that the confronting of the respondent by certain conditions or con- 
tingencies which he has not previously considered will tend to dis- 
tort his immediate decision. That is, he may react by overempha- 
sizing the importance of certain positive or negative factors when 
he first considers such factors, although after careful thought he 
might decide they were much less important to him. 


Questions Presenting a Statement of Specific 
Future Contingencies 


Whether or not the matter was made explicit in the questions 
asked, the plans expressed by the soldiers were to a considerable 
degree contingent plans. ‘Zf my old employer is still in business—” 
“Tf I am able to get a large enough loan—”’ ‘“‘Jf—” 

Theoretically, it would have been possible to present some of the 
most important contingencies in a series of questions and to evaluate 
plans under each for all soldiers. This was actually done in the 
first survey which attempted to ascertain plans for school, as has 
been discussed in detail, and in subsequent efforts to ascertain in- 
terest in staying in the Army after the war. It would have been 
extremely cumbersome, however, to present such a series of contin- 
gencies for every area of interest for prediction, and, moreover, it is 
doubtful whether a majority of the men would have been able to 
escape a feeling of confusion if forced to evaluate their plans under 
each separate contingency. 

But another more serious problem underlies the use of contin- 
gency questions. There is evidence that, by and large, responses 
to such questions do not provide a basis for predicting how the re- 
spondent would act, given the materialization of the contingency, 
but rather reflect merely whether or not the respondent considers 
the specified conditions as more or less favorable than those other- 
wise assumed to exist. In fact, the statement of conditions which 
seem at first glance very favorable may trick some individuals into 
making a prediction that they would utilize the favorable circum- 
stances, whereas in reality other factors make such behavior ex- 
ceedingly improbable. For example, when asked, “Do you think 
you will actually go back to full-time school or college after the 
war?” 7 per cent of the men questioned in the summer of 1943 
said “yes.” Most of these men also replied that they would go 
back whether or not the government aided them. Another 18 per 
cent said they would go to full-time school if the government pro- 
vided aid. The government did make available such aid, but the 
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proportion thereafter stating that they would return to school was 
increased by very little. Many of the additional 18 per cent who 
had thought they would go back if this favorable contingency mate- 
rialized apparently found that in reality government aid was not 
the deciding factor. One might, in fact, say that in the face of the 
stress on the desirability of increased education, some men had ra- 
tionalized a previously formulated decision not to go back to school 
in terms of not being able to afford going back without government 
aid, whereas the real reasons for not going back lay elsewhere. 


Attitude Scales and Prediction 


At any given instant in time, one might assume the theoretical 
possibility of ranking soldiers in a series of finely graded classes as 
to the degree of their readiness to follow some specific course of 
action at some future date (the assumption of the existence of an 
attitude scale). In the case of the present research, the best pos- 
sible data on soldiers’ intentions as an aid in predicting entrance 
into any particular field of endeavor after the war would then seem 
to be given by their ranking on such a scale. Other things being 
equal, the higher the ranking, the greater the probability of actu- 
ally carrying out the stated intention. 

To some extent, it was possible to rank individuals in a small 
number of classes along an assumed continuum, either by the use 
of a single question whose answer categories furnished degrees of 
intensity (e.g., question on plans to return to work for former em- 
ployer), by the use of one question involving degree of intensity and 
one involving statement of choice (e.g., final choice of alternatives, 
followed by question on degree of certainty of plans), or by the use 
of a battery of questions found by scale analysis to approximate the 
distribution of responses obtained when only a single dimension was 
involved (e.g., original questions on plans to attend school). Such 
rankings were admittedly crude, but in every instance where they 
were attempted, with one exception,* it was found that the higher 
the scale category, the greater was the proportion who carried out 
their plans. This finding may be taken as evidence that whether 
or not attitudes may be “‘measured,” rough assessments of stated 
intentions to act along some specified dimension may possess suffi- 
cient validity to permit graded predictions. 

% This was in the case of July separatees’ school plans, where the small sample of 
“tentative” planners returned in slightly higher proportion than “definite” planners. 
Almost certainly this was simply a sampling fluctuation, since the finding was reversed 
for December separatees. 
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Nevertheless, the assumption that individuals may, at least in 
theory, be ranked in an indefinite number of classes as to the degree 
of their readiness to follow a specific course of action ought to be 
scrutinized. More specifically, let us consider the attempt to rank 
soldiers as to their intentions to attend school. Let us say that the 
objective was to devise a scale containing a minimum of ten or 
twelve categories, each one representing a greater degree of inten- 
tion or determination to attend school than the category next below 
it. The first step was to find out how a large number of soldiers 
thought and felt about the possibility of attending school after the 
war. What were the attractions of attending school, what were 
the conditions that would make them more or less likely to go, what 
assessments were made of the situation likely to prevail during 
demobilization, what were the major uncertainties? Having dis- 
covered that soldiers’ thinking was concerned with the availability 
of job opportunities on the one hand or government aid for educa- 
tion on the other hand, probable age at discharge, the possibility of 
marriage, and a number of other factors, the researcher attempted 
to write questions which would reflect the influence of these contin- 
gencies on the decisions to be made. It has been pointed out that 
the answers to contingency questions cannot be taken at face value; 
but what of the possibility that such answers could provide a basis 
for ranking individuals along a single dimension—intention or de- 
termination to attend school? If a person says he would go to 
school under the most unfavorable circumstances (relative to school 
attendance) surely he should plan to go under more favorable cir- 
cumstances. In general, this was found to be true, but, as has been 
observed, individuals differed as to whether a particular circum- 
stance was considered to be favorable or unfavorable. Men who 
had started college were less deterred from planning to return by 
the thought of a long war than were those who had merely finished 
high school; men of independent means who were considering at- 
tending college as one of several alternatives were less influenced by 
the availability of government aid than were the less well-to-do 
men whose considerations involved college as an alternative. In 
so far as questions involved contingencies or values which had a 
differential appeal to different men, the questions would not permit 
the ranking of individuals along a single continuum. Where such 
rankings were approximated through question batteries using con- 
tingency questions, the approximation was made either by com- 
bining response categories which destroyed the meaningful compo- 
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nent of the contingency question—e.g., where individuals who said 
they would go to school in the face of an unfavorable situation were 
combined with those who said they would go only if the unfavorable 
situation did not materialize—or where the differential effect of 
contingencies or values was relatively slight. Thus a question 
presenting a contingency having a markedly different effect on 
ranking than other questions would have to be dropped out of use 
if one wanted to use a battery of questions as a scale. 

There are, of course, other types of questions which may be asked 
as part of a battery to yield an expression of plans with respect to 
any area. There are, as has been observed, various ways of putting 
the question of plans for school without mentioning contingencies, 
and in general these various ways yield somewhat different clas- 
sifications. It has been observed that a direct check-list question 
yielded a larger proportion reporting school plans than did a choice 
among alternatives. By and large the responses to these questions 
fulfilled the requirements of scale theory that all individuals positive 
on the more extreme statement (choice among alternatives) should 
likewise be positive on the less extreme. Through the use of such 
questions, which do not call to mind different values attaching to 
the object or different situations to be encountered, it would again 
be theoretically possible to arrive at a type of ranking which seems 
unequivocal for a majority of individuals. Moreover, the correla- 
tion of any outside variable with the scale scores derived from such 
a ranking is precisely the same as the multiple correlation of that 
outside variable with the items in the scale.” 

Realistically, however, one cannot ignore the fact that the deci- 
sions to be made will be made on the basis of weighing values and 
contingencies. Admitting that scale scores will give the best pos- 
sible predictions to be made from a group of items which form a 
scale, it is precisely the items which do not fit into a scale which 
pose the crucial problem for classification. It is only by assessing 
as best one can the probable materialization of contingencies and 
the implications of the responses to questions getting at such con- 
tingencies and values that meaningful prediction for the individual 
can be made. 

It may be worth while to present the pros and cons of the various 
types of questions in tabular form. Again, it must be emphasized 
that these are tentative generalizations; one cannot predict that 
they will hold true in other studies where a different set of variables 


_% See Chapter 1, this volume. 
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is involved. Nevertheless, they may be worth consideration when 
similar studies are undertaken. (See Table 19.) 

In so far as the predictive value of responses to a single question 
is concerned, it is not possible to state categorically that one type of 
question is always superior to another. Where plans with respect 
to a given dimension can be reasonably well evaluated for nearly all 
members of a sample through the responses to a general write-in 
question, it would appear that such responses are in general more 
highly predictive than are responses to a single direct check-list 
question, especially for areas where aspirations run high and clearly 
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TABLE 19 


CONSIDERATIONS RELATING TO THE Use or Various TYPEs OF QUESTIONS IN 
SELF-ADMINISTERED QUESTIONNAIRES FOR PREDICTING SUBSEQUENT BEHAVIOR 


Advantages 


Type of question 


Semi-nondirective 


Ideally, gets respondent’s own 


Disadvantages 


May evoke irrelevant response 


(open-ended) 


Directive, single 
dimension (check 
list) 

a. No stated con- 
tingency 


b. Contingency 
(additional 
considerations) 


Choice among check 
list of alternatives 


Scale battery of 
check-list ques- 
tions 


frame of reference 

Permits qualification and 
elaboration 

Avoids suggestion 

Avoids prestige element 


Permits unequivocal classifi- 
cation of nearly all respond- 
ents for specified dimension 

Few coding or editing prob- 
lems 


Permits rough assessment of 
importance of specified con- 
tingency to plans 


Provides criterion for un- 
equivocal classification of 
individuals considering al- 
ternatives 

Permits explicit review of 
alternatives 

Provides convenient means of 
estimating the number un- 
decided as to future status 


Permits rough ranking of indi- 
viduals with respect to given 
continuum 


dimensions 

Does not permit ranking of all 
respondents along any single 
dimension 

Gets little data from poorly edu- 
cated or inarticulate persons 

Difficult to code 


May impose arbitrary dimensions 
and obscure meaningful ones 
May overemphasize area under 
consideration 

Favors prestige items 

Does not indicate first preference 
if alternatives are also under 
consideration 


May overstress importance of 
specified contingency 


May lead to overstressing of 
prestige items 

Cutting points may not be best 
for prediction 


Ignores differential effects of con- 
tingencies 
Does not provide cutting points 
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formulated plans are relatively rare. A general write-in question 
will permit sample-wide evaluation of plans with respect to a given 
dimension only when that dimension is meaningful in the planning 
of nearly all of the individuals sampled. Thus, in the case of such 
dimensions as “‘school—nonschool,” “farm—nonfarm,’’ and ‘‘self- 
employed—working as employee’ which actually seem to have 
been dichotomized in soldiers’ thinking (and in so far as a reason- 
ably complete response to the write-in was given), it was possible 
to make a fairly adequate classification of plans from the write-in 
alone. But where the dimension was one which was of secondary 
importance in the thinking of a large segment of the survey group— 
as in the case of government jobs—or one which involved the com- 
bination of several very strong components—as in the case of plans 
to return to a previous employer, involving as it did job rights, 
occupation, personal ties, etc.—it appears that plans could best be 
assessed by the use of direct questions which clearly specified the 
dimensions along which classification of plans was desired. Re- 
sponses to such questions required supplementation as well as a 
knowledge of the more meaningful dimensions, but at least they 
permitted very rough gauging of potential interest. Where plans 
simply were not crystallized with respect to an area, no attempted 
assessment of plans could have provided an adequate basis for 
prediction. 


Observations of the Prediction Problem 


The objectives of these studies of soldiers’ plans were such that 
several different kinds of data were required. For certain types of 
program planning there was a need for the best possible estimate of 
the number of veterans who would actually enter specified fields— 
full-time school, agriculture, etc. At the same time there was a 
need for estimates of the amount of pressure which would be put 
upon existing facilities, along with a knowledge of the characteris- 
tics of men planning to enter fields which were sure to be over- 
crowded. Thus there was a need on the one hand for an assessment 
of the potential opportunities available to veterans and on the other 
hand for a study of the various other factors which might affect the 
actual activities engaged in by veterans subsequent to discharge. 

Although predictions for the individual were not required as such, 
the absence of any preexistent trends which might be extrapolated 
meant that the estimates sought had to be built up from an analysis 
of the problems of decision faced by individual veterans, and of the 
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relationship between expressed plans and other personal character- 
istics and performance in the situation to be encountered after dis- 
charge. If plans were to provide a basis for reasonably accurate 
predictions for the individual, it was noted that at least four basic 
conditions would have to be met: (1) that the large majority of 
soldiers be at least roughly classifiable as to their plans; (2) that 
the method used to assess plans yield both reliable and valid clas- 
sifications, so that the estimate would have some stability and 
would reflect a real tendency to carry out the stated course of action; 
(3) that both the individual and the situation(s) upon which his 
plans were predicated would not change so markedly as to lead to a 
major shift in plans after assessment; and (4) that the individual’s 
plans (including his definition of the situation) be sufficiently real- 
istic so that the plans could be maintained when the actual situation 
materialized. 

In so far as these conditions were not met, errors of individual 
prediction tended to arise, but such errors, particularly those result- 
ing from a failure to attain the first two conditions, did not neces- 
sarily invalidate estimates of the proportions of soldiers who would 
follow specified courses of action, since to some degree the errors 
were compensating. As had been anticipated, some men with defi- 
nite plans yielded those plans in the face of unexpected difficulties, 
while others with only tentative or vague plans decided to follow 
the course deserted by others. 


Statements of Intention as Predictive Items 


Even if a complete and accurate knowledge of the individual’s 
plans and intentions could be secured by questionnaire, the predic- 
tive value of such plans and intentions would remain in question. 
Dollard® has made a tentative analysis of the conditions under 
which opinions and statements of intention permit accurate predic- 
tion of behavior. Certain of the concepts he uses will be helpful 
here. He uses the term “origm situation” to refer to the situation 
or process in which the opinion or intention was originally formu- 
lated. In the case of most soldiers who reported definite plans when 
surveyed, it may be assumed that the origin situation preceded the 
actual questioning. A decision had been made—perhaps years 


*® John Dollard, ‘Under What Conditions Do Opinions Predict Behavior?” Public 
Opinion Quarterly, Vol. 12, No. 4 (Winter 1948-49), pp. 623-632. This paper was 
originally read before a joint meeting of the Washington Statistical Society and the 
Washington chapter of the Institute of Mathematical Statistics, March 9, 1944. 


690 STUDIES OF POSTWAR PLANS 


before, perhaps only a few weeks or even days before—as to what 
they would try to do after discharge. When the “test situation” — 
the questionnaire—was encountered, this decision was reported. 
For other soldiers, the test situation was encountered before definite 
plans had been crystallized, and the questionnaire may actually 
have been the basis for reaching a crystallization of plans. About 
a sixth of all soldiers surveyed were unable to reach even a tentative 
formulation of plans at the time of discharge, although they reported 
their various interests and proclivities with respect to a number of 
possible activities. For all of these groups of men there lay ahead 
a third important situation—the one around which prediction was 
centered. Dollard has called this the “criterion dilemma.” For 
the soldiers surveyed, the criterion dilemma involved the problem 
of settling back into civilian life as students, as employees, or as self- 
employed workers. 

Dollard has attempted to specify certain conditions favorable to 
accurate prediction from statements of opinion or intention. They 
may be divided into two types: those relating to the characteristics 
of the individual and those relating to the congruence of the three 
situations in which opinions or intentions are formulated, expressed, 
and put to the test of action. Individual characteristics favorable 
to self-prediction are: (1) freedom from conflicts of which the indi- 
vidual is unaware, (2) verbal skill for the analysis of one’s own in- 
clinations and intentions, and (3) habitual action on the basis of 
previous thought. Situational conditions favorable to prediction 
are: (1) an origin situation closely approximating the criterion 
dilemma, (2) a test situation uncomplicated by extraneous rewards 
or punishments, (3) a test situation closely approximating the cri- 
terion dilemma in presenting the relevant factors eventually to be 
considered, and (4) no new origin dilemma—i.e., no shift in inten- 
tions—subsequent to the test situation. With the exception of the 
last item (which is essentially the same as the third of four basic 
conditions laid down for successful prediction in this study), the 
above conditions all relate to the (fourth) general requirement pre- 
viously stated, that plans and intentions must be “‘realistic’’ if they 
are to permit accurate prediction. On the basis of the studies of 
soldiers’ postwar plans, it is possible to supplement certain of these 
conditions enumerated by Dollard. 

First, considerable support is given to Dollard’s common-sense 
statement that “‘a man can best predict what he will do in a future 
situation if he has been in about the same situation before.”’ Obvi- 
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ously, few soldiers had previously faced the problem of deciding 
what they would do when discharged from the service, but those 
who expressed plans to return to an activity they had already expe- 
rienced were more likely actually to do so than were those who 
planned to try a different activity. Thus men who had been in 
school and planned to return realized their plans in higher propor- 
tion than did those who before the war had been employed but 
planned to shift to school after the war. Again, plans being com- 
parable, those men who had been self-employed were more likely 
to resume that status than were men who had worked for an em- 
ployer. The same tendency for experience to be associated with 
more accurate self-prediction was noted in the case of soldiers re- 
turning to work for a previous employer, in the case of soldiers re- 
turning to farming, and in the case of those seeking government 
jobs. 

In all the above instances, one may say that the man who had had 
previous experience was able to make a more realistic judgment as 
to what he would do, not merely because he had a truer knowledge 
of the criterion situation but also because there were continuing 
pressures upon him to utilize the benefits of his previous experience. 
This is reflected in the fact that even when men did not plan to re- 
sume the activity in which they had been engaged, they tended to 
do so in larger proportion than did other men with comparable 
plans. 

Dollard observes, as separate conditions, that prediction is favored 
if “no new origin dilemma intervenes between the test and criterion 
situation” changing the individual’s intentions, and ‘“‘if the test 
question explicitly presents the conflict, i.e., anticipations of rewards 
and punishments, of the criterion dilemma.” In a sense, both of 
these conditions relate to the previous condition that the situation 
postulated in formulating plans be similar to the situation subse- 
quently encountered. The end of the war, subsequent to the state- 
ment of plans but prior to the follow-up of July separatees, posed a 
new origin situation, calling for the reformulation of job plans, at 
least by some men, because it changed job opportunities in the cri- 
terion situation. Likewise, marriage on the part of the man who 
had planned to return to school might produce, or be part of, an- 
other type of origin dilemma. The test question could approximate 
the criterion dilemma only if it brought to mind the possibilities of 
the GI Bill and the difficulties posed by marriage for the potential 
student. Yet, if these possibilities and difficulties had not been 
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recognized in the process of formulating plans, originally, then the 
test question might itself pose a new origin dilemma. 

It is at this point that one may question or at least desire to qual- 
ify Dollard’s formulation. To say that “prediction is favored pro- 
viding that no new origin dilemma intervenes between the test and 
criterion situation” is in a sense merely to say that people will 
do what they said they will do if they don’t change their minds first. 
But it carries also the implication that once the individual has 
stated his intentions, he remains in this respect static unless some 
crucial experience leads him to change his intentions. Actually, 
where true dilemmas are involved, there is usually a good deal of 
vacillation in the individual. In any given test situation, the indi- 
vidual is subject to moods and the influence of recent events. Any- 
one who has faced a major decision relating to a change in career, 
where the pros and cons very nearly balanced, knows that a decision 
made tentatively and not yet regarded as a commitment may be 
subject to considerable modification and fluctuation. Although 
the present study does not adequately document this point, it is the 
writer’s belief that a large chance factor is involved in both the test 
and the criterion situation. Much depends upon the mood and 
recent experiences of the subject at the times when he is faced with 
reporting his plans and with making the actual decision. 

In a sense then, the concept of origin situation is an artificial one 
in cases where a definite determination to act in a given way has not 
been formulated. In such cases, one attempts to assess a prevail- 
ing orientation, not to present a stimulus question which will evoke 
a ready-made response. 

To restate Dollard, prediction of choice or decision at some future 
date (or of the carrying through of choices already formulated) is 
favored if the respondent is led to consider all of the alternatives 
which will be open to him and the consequences involved in the case 
of each alternative. It is probable that almost any prediction 
which contains some degree of ego-involvement is tinged somewhat 
by the aspirations of the predictor.*! If the respondent can be 
confronted with some of the problems to be encountered in the cri- 
terion dilemma, the effects of such wishful thinking may to some 
extent be minimized. 

" Karly in the war, it was found, for example, that men most desirous of avoiding 
overseas or combat service tended to predict a considerably shorter war than did men 
who felt that they were ready for combat. See also Ogburn’s finding on the prediction 


of football scores and college grades: W. F. Ogburn, ‘Studies in Prediction and the 
Distortion of Reality,” Social Forces, Vol. 13, No. 2 (December 1934), pp. 224-229. 
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Wishful thinking must be considered not only in connection with 
the aspiration level of the respondent but also in connection with the 
implicit assumptions he makes as to the development of contingen- 
cies. It has been stressed at several points that widespread unem- 
ployment of veterans would probably have produced far greater 
errors in prediction than were actually observed. Yet widespread 
unemployment actually was anticipated by a great many veterans. 
Toward the close of the war, concern about future jobs led the list 
of personal anxieties reported. If, then, soldiers anticipated unem- 
ployment, why should the self-predictions which they made materi- 
alize most accurately in a situation of high employment and relative 
prosperity? One answer seems to be that optimism or wishful 
thinking led many soldiers who expected a postwar depression to 
discount the effects to themselves. Whereas 79 per cent thought 
most soldiers would find it ‘“‘very hard’’ or ‘fairly hard” to get the 
kinds of jobs they wanted after the war, only 46 per cent thought 
they personally would find it “very hard” or “fairly hard” to get 
the kinds of jobs they wanted. Thus, between optimism with re- 
spect to their own competitive positions and general aspirations that 
led to reaching beyond immediate realities if poor economic condi- 
tions should prevail, many soldiers who predicted poor conditions 
were actually planning for prosperity. 

The experience with contingency questions, however, casts some 
doubt as to whether prediction is always favored “if the test ques- 
tion explicitly presents the conflict, i.e., anticipation of rewards and 
punishments, of the criterion dilemma.” As already suggested, 
rewards or punishments considered in the test situation may lead to 
a different decision from what would be arrived at when those fac- 
tors were faced in the total context of the criterion situation. It is 
patently impossible to sketch out all of the relevant factors relating 
to an occupational choice, in questioning respondents as to their 
plans. In mentioning some factors, to the exclusion of others, it is 
conceivable that one will elicit responses even more divergent from 
those which would be made in the criterion situation than if the re- 
spondent were left to consider the factors regarded as relevant by 
him. 
Other things being equal, a person can better predict his choice 
with respect to an activity when it is to be primary than when it is 
to be secondary. In a more limited sense, choice of a full-time ac- 
tivity can be more accurately predicted than can choice of the same 
general course if it is to be pursued as a part-time activity. The 
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prediction of enrollment of full-time students was considerably 
more accurate than was that of part-time students. Again, predic- 
tion of the amount of postseparation vacation to be taken was con- 
siderably less accurate than was prediction of the primary activity 
pursued. Choices which involve a primary activity must, to some 
extent at least, be squared with choice of place of residence, avail- 
able opportunities, and possible alternative activities; but a second- 
ary or part-time activity can be chosen without ruling out other 
activities in which one is interested. Thus, less rigorous thinking 
is required in arriving at the decision in the instance of secondary 
activities. Moreover, it is probable that aspirations or pressures 
to “give the right answer’’ are more likely to affect the reporting of 
a future action which need not be brought into congruence or pos- 
sible conflict with other actions. 

Where prediction from intentions opposed the prediction which 
would have been made on the basis of personal characteristics and 
background factors alone, accuracy of prediction was lower than 
when intentions and characteristics were both favorable to perform- 
ance. But plans or intentions definitely held (positively or nega- 
tively) permitted more accurate predictions for most subgroups of 
the population studied than could have been made on the basis of a 
knowledge of personal characteristics and background factors alone. 

Support is given to the common-sense generalization that a person 
is best able to predict his subsequent behavior if he feels fairly cer- 
tain of his plans. Clear evidence was afforded in every area of 
prediction here studied, that men relatively sure of their plans were 
far more likely to carry out those plans than were those who re- 
ported some uncertainty. 

There were no consistent indications that better educated men 
predicted their activities more successfully than men with less for- 
mal schooling, among individuals holding definite plans. On the 
other hand, the men with relatively little formal schooling were 
more likely to be undecided in their plans, so that somewhat poorer 
predictions were made for this group as a whole, in comparison 
with high school graduates. 

Finally, the present study has clearly indicated that an analysis 
of the future situation for which one is attempting to predict be- 
havior is a crucial element in the prediction process. Knowledge 
of the relationship between intentions or attitudes and behavior in 
one situation does not insure accurate actuarial predictions even for 
seemingly similar situations. One might say that accurate predic- 
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tion for the individual is favored to the extent that he is able to 
manipulate or control the pressures upon his behavior in the cri- 
terion situation. Where opportunities to carry out plans and in- 
tentions are good, a much higher proportion of correct predictions 
will be made than where unforeseen pressures demand compromises 
and reevaluations. 

If the primary interest in these studies had been the prediction of 
individual behavior as such, one would certainly have wanted in- 
formation on the specific situations to which the men were returning 
—on the views of close friends and relatives who might influence 
the returned veteran, on the specific opportunities and disabilities 
to be encountered, etc. But in this study, and probably in most 
studies of this sort where attitude data are utilized for planning a 
program, generalized assessments, both of intentions and of situa- 
tional factors, have to suffice as a basis for qualified predictions. 


Appendixes to Chapter 16 


Appendix A. Questionnaire Used to Survey Soldiers at Separation Centers 


1. What is your Army rank or grade? (Put a check on the line in front of your answer 
like this /) 


1 Private or Pfc 
2 Corporal or T/5 
3 Sergeant, any grade 


2. How old were you on your last birthday? (Please fill in the blank below) 
. years old on my last birthday 


3. Are you: (Check one) 


1 Single 
2 Married 
3 Widowed, divorced or separated 


4. How many children do you have? (Check one) 
I’m not married 

Have no children 

Have one child 

Have two children 

Have three or more children 


5. How far did you go in school? (Check only one answer—the highest grade of school 
you completed.) 

Less than 6th grade 

Finished 6th grade 

Finished 7th grade 

Finished 8th grade 

Some high schoo! but did not finish 

Graduated from high school 

____— Some college but did not finish 

____— Graduated from college 
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6. How many years was it between the time you left full-time school and the time you 
came into the Army? (Check one) 
~——___— I came into the Army less than 3 months after I left full-time school 
3 months up to 1 year 
—__ Over 1 year up to 2 years 
—__—_ Over 2 years up to 3 years 
——__~ Over 3 years up to 5 years 
Over 5 years 


7. What is your race? (Check one) 
1 White 
2 Negro 
3 Other 


8. How long have you been in the Army? (Check one) 
1 6 months or less 


2 Over 6 months up to 1 year 
3 _____ Over 1 year up to 2 years 
4 Over 2 years up to 3 years 
5 ______ Over 3 years up to 4 years 
6 ____. Over 4 years up to 5 years 
7 —_—_ Over 5 years up to 10 years 
8 ____. Over 10 years 


9. (a) Have you served outside the Continental United States in this war? (Check 
one) 


= Yeu 
1 No 
(b) Have you been in actual combat in this war? (Check one) 
2 No 
3 ____ I have been under enemy fire, but not in actual combat 
4 I have been in actual combat 


10. What branch of service were you last in before you came to the Separation Center? 
(Check one) 

Engineers 

2 Field Artillery 

3 _____ Quartermaster Corps 

4 Infantry 

6 Medical Department 

6 ____._ Armored Force ~ 

7 

8 

9 


Air Corps 

Transportation Corps 

— —____ Ordnance Department 

_ Chemical Warfare Service 
— —.__ Military Police 


——— Tank Destroyer 
Other. Which one? 


11. In what State of the United States did you spend most of the last five years of your 
civilian life before you came into the Army? 


(Write the name of the State on the line above) 


12, 


13. 


14, 


15. 


16. 


17, 


BSBTUDIES OF POSTWAR PLANS 697 


Where were you living most of the last five years of your civilian life before you 
came into the Army? (Check one) 
1 On a farm 


2 In the country but not on a farm 

3 In a village with fewer than 2,500 people 

4 In a small city with between 2,500 and 25,000 people 
5 In a city with between 25,000 and 100,000 people 

6 In a large city with more than 100,000 people 


What living arrangements did you have before you came into the Army? (Check 
one) 


1 Lived in the home of my parents (or wife’s parents) 
2 Lived in the home of other relatives or friends 

3 Lived in a boarding or rooming house 

4 Rented an apartment 

5 _____ Rented a house 

6 ____. Owned a house 

7 Other. What? 


Which of the following were you doing just before you came into the Army? (Check 
as many as you were doing just before you came into the Army.) 


1 Working full-time for an employer, for salary, wages or commission 
2 Going to full-time school or college 

3 Farming (either with your family or for yourself) 

4 Had my own business (did not receive salary, wages or commission) 
5 Looking for a job 

6 Working part-time for an employer 

7 Going to part-time school or college 

8 _____ Something else. Whai? 


In the LAST job you had before coming into the Army, were you working for a 
company or person or branch of the government? (Check one) 


1 A company or person with fewer than fifty employees 
2 A company or person with fifty or more employees 

3 A branch of the Federal Government 

4 A branch of a state government 

5 A city, town or county government 

6 Other employer. What? 

7 I was not working for an employer 


How long did you work for your last employer before you came into the Army? 


(Check one) 
1 Less than six months 


2 Six months up to a year 

3 One year up to two years 

4 Two years up to three years 

5 Three years up to five years 

6 Five years or more 

7 I was not working for an employer 


On the average how much money were you earning in your LAST job? 
(Fill in one of the blanks below.) 
Weekly pay. ...... $F séperr eek 
or 


Monthly pay ...... $e SSCéers moorntth 
I did not receive any regular pay 
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18. (a) What kind of work were you doing just before you came into the Army? What 
was your job called, what business or industry did you work in, and what did 
you dof (For example: “Sales clerk, waited on customers in a grocery store” or 
“Weaver, operated a loom in a cotton textile mill.’’) 


(b) About how long did you do that kind of work, either on your LAST job or other 
jobs? 
About _______ years and ________ months 


THIS IS THE MOST IMPORTANT QUESTION IN THE QUESTIONNAIRE. 
READ IT THROUGH VERY CAREFULLY. 


19. What kind of work do you plan to do after you get your discharge and take a vaca- 
tion? In your own words describe what you think you will do for the first few years 
of your civilian life. Tell whether or not you have some definite job lined up, write 
the name of the job, and describe it as fully as you can. If you do not have a defi- 
nite job lined up, tell what kind of work you will probably do. If you plan to take 
one job at first and change jobs later, tell what you plan to do first and also what 
you will try to change to. If you are undecided, tell what you have been thinking 
you might do. 

Try to write fully enough about your job plans for the next few years 
80 that we will understand them about as well as you yourself under- 
stand them at the present time. 


20. How sure do you feel that after leaving the Army you will actually do the kind of 
work you described just above? (Check one) 


1 _____ I’m almost sure I will 
2 I probably will but I may do something else 
3 ______ I’m really not at all sure what I will do 


21. Before you came into the Army, did you ever do the kind of work you think you 
will most probably do after you leave the Army? (Check one) 
1 No, I never did that kind of work 

No, but I studied to do that kind of work before I came into the Army 
No, but I learned that kind of work in the Army 
—__— Yes, I did that kind of work in civilian life 

How long did you do that kind of work? 

For about _______ years and _________ months 


2 
3 
4 


22. Regardless of whether you have definite plans or not, which one of these things do 
you think you will most probably do FIRST—right after you leave the Army? 
(Check one—if you plan to do more than one, check the one you expect to spend the 
most time at) 

I will probably work for an employer, for wages, salary or commission 

I expect to work on a relative’s farm (father’s, uncle's, etc.) 

I expect to work in a relative’s business ( father’s, uncle's, etc.) 

I plan to go to full-time school or college 

I plan to run my own business or farm 


1 
2 
3 
4 
5 


23. 


24, 


265. 


26. 


27. 


28. 


29. 
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Do you plan to work for an employer the FIRST thing after you leave the Army 
and have a vacation? (Check one) 


1 Yes, I plan to work full-time for an employer 

2 I plan to work part-time for an employer. What else will you dof 

3 I don’t have definite plans, but I will probably work for an employer 
4 No, I don’t plan to work for an employer 


Do you think you could get work with the employer you worked for before you 
came into the Army, if you wanted to? (Check one) 

Yes, I’m almost sure I could 

Yes, I think so, but I’m not sure 

No, I probably couldn’t 

I’m sure I could not 

I was not working for an employer before I came into the Army 


on PONE 


Do you think you actually will go back to work for the same employer you worked 
for before you came into the Army? (Check one) 
1 Yes, I’m almost sure I will 
2 Yes, I probably will 
3 —____ I may, but I probably will not 
4 
5 


No, I’m quite sure I won't 
I was not working for an employer before I came into the Army 


If you do not plan to go back to work for your last civilian employer, do you know 
of some other job you can get and plan to take? (Check one) 


1 I do plan to go back to my last civilian employer 

2 _____. Yes, I know of a job I can get and I plan to take it 

3 ____ I know of a job that I can probably get and I may take it 

4 No, I shall have to look for a job 

5 I don’t expect to work for an employer right after I leave the Army 


What type of employer do you expect to work for in your FIRST job after you 
leave the Army—a company or person or branch of the government? (Check one) 
A company or person with fewer than fifty employees 

A company or person with fifty or more employees 

A branch of the Federal government 

A branch of a state government 

A city, town or county government 

Other employer. What? 


I have no idea what type of employer I’ll work for 
I’m not planning to work for an employer right after I leave the Army 


ON Sar ON = 


As far as you can tell now, do you plan to stay in the first job you take for at least 
five years, or do you expect to change to some other job or type of work within five 
years after you leave the Army? (Check one) 


1 I plan to keep my first job for at least five years, if I can 

2 I plan to change to something else within five years after I leave the 
Army 

3 I don’t have any plans to change, but I might want to 


If you plan to leave your first job within five years, what do you plan to do after 
you leave your first job? 
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30. Do you plan to buy or start a business of your own soon after you leave the Army? 
(Check one) 


1 I already own a business which I expect to operate after I leave the 
Army 

2 No, I don’t plan to buy or start a business 

3 _____. Yes, I am almost sure I will buy or start a business 

4 I may, but I’m not sure 


(If you already have, or plan to have your own business, answer the questions below [ques- 
tions 81-87]. If not, go on to question 38.) 


31, What kind of business do you plan to operate and what do you yourself expect to 
do in this business? —-- 


32. If you have had some experience in this kind of business, where did you get your 
experience? (Check one) 


1 In my own business 

2 In my family’s business 

3 In a business owned by someone else 

4 In the Army 

5 _____ I have not yet had any experience in such a business 


33. Do you expect to start a new business or go into a business that is already running? 
(Check one) 
1 


2 
3 
4 


T plan to start a new business 

I plan to buy a business or a share in 8 business that is already running 

I plan to take over a business that now belongs to someone else in my 
family 

I already own a business in my own name 


34. Do you expect to own the business by yourself or do you expect to go into partner- 
ship with someone else? (Check one) 


1 I expect to own the business by myself—in my own pame 


2 I expect to go into business as part-owner with my father or some other 
relative 
3 I expect to go into partnership with someone not related to me 


35. What do you estimate that the total cost (both cash and credit) of getting under 
way will be? (The total amount for buying or renting a building, tools, supplies, or 
whatever you will need, but not counting your living expenses while you are getting 
started) 

About:§.- 
I haven’t yet made any estimate of what the business would cost 


36. Have you made any actual arrangements yet for getting and operating a business, 
or is your idea still in the planning stage? (Check one) 
1 I already own a business 


2 I have been thinking about running a business but have not made any 
actual arrangements yet 
3 


I have already made certain arrangements for getting and operating a 
business. What arrangements have you madef 
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37. Do you plan to go into business for yourself as soon as you leave the Army or will 
you do something else first? (Check one) 


1 TI plan to work for someone else for a while first 


2 I plan to go to school for a while first 
3 I plan to go into business for myself as soon as I Jeave the Army 
4_____ Something else. What? 


el 


(Everyone should answer these questions) 


38. Do you feel that you have had as much school or college as you want, or would you 
like to go back to school or college after the war? (Check one) 


1 I have had as much school or college as I want 
2 I would like to go back to full-time school or college 
3 I would like to go back to part-time school or college 


39. Regardless of what you would like to do, do you think you will actually go back to 
school or college after you leave the Army? (Check one) 


1 Yes, to full-time school or college 


2 Yes, to part-time school or college 
3 No, I don’t think I will go back to school or college 
4 Undecided 


(If you do plan to go to school or college either full-time or part-time, answer the questions 
below [questions 40-45]. If not, go on to question 48.) 


40. What kind of school are you planning to attend? (Check one—if you plan to attend 
more than one kind of school, check the kind of school you will go to first.) 


1 High school, general academic 

2 High school, vocational 

3 —____ Trade school (Diesel, electrical, etc.) 
4 Business school 

5 ____. College or university 

6 Other. What kind? 


41. Just what course or type of course do you plan to take? (Give the name of the course 
and a brief description of what it covers) 


42. What school or college do you plan to go to? 


I haven’t decided yet 
43. Do you plan to enroll in school or college when the next school term begins? (Check 
one) 
1 _____ Yes, I plan to enroll when the next school term begins (or even sooner) 
2 No, I don’t plan to enroll until later 
When do you plan to start school? 
Month ----- — ——__— —._ —- =... Year. 
44. How long do you expect to go to school or college? (Check one) 
1 Less than six months 
2 Six months up to one year 


One year up to two years 
Two years up to three years 
Three years up to four years 
Four years or more 


3 
4 
5 
6 
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45. What kind of work do you plan to do after you finish school? 


(Everyone should answer these questions) ‘ 


46. Do you think you will try to get a job with some branch of the government (Federal, 
state, ctty, etc.) right after you leave the Army? (Check one) 


1 I left a government job to enter the Army, and I expect to return to it 

2 I am definitely going to try to get a government job 

3 I will probably try to get (or return to) a government job, but I am not 
sure 

4 I may try to get a government job 

5 I am not interested in a government job 


47. If you plan to get a government job, what will you do if you have to wait a while 
after you apply before you actually get the job (either until you take a Civil Service 
test or for some other reason)? (Check one) 


1 I don’t plan to apply for a government job 

2 I plan to do something else until I can get a government job. What? 

3 ____ I will just wait until I can get a government job 

4 I am almost sure I will be able to begin work in a government job right 
away 

5 I don’t want a government job if I have to wait to start work 


48. Have you ever worked on a farm for as much asa year? (Check one) 


1 Yes, full-time for a year or more 

2 Not full-time, but I grew up on a farm 

3 I worked on a farm, but less than one year 
4 No, I never worked on a farm 


49. If you worked on a farm, whose land did you farm on the last time you worked ona 
farm? (Check one) 


1 I never worked on a farm 

2 I owned the land in my own name 

3 My family owned the land and I was farming with them 

4 My family rented the land and I was farming with them 

5 I rented the land from my family, but was farming entirely for myself 
6 I rented the land from some other landlord 

7 I was working on someone else’s farm for salary or daily wages 


50. Do you think you will do farming when you get out of the Army? (Check one) 


1 Yes, I am almost sure I will farm full-time 

2 T plan to farm part-time—that is, to have a steady job off the farm at 
the same time. Whatjobf oss 

3 _____ I may do farming full-time, but I’m not sure 

4 I may do farming part-time, but I’m not sure 

5 No, I’m almost sure I won't farm 


(If you plan to farm, either full-time or part-time, answer the questions below [questions 
61-58]. If not, go on to question 59.) 


51. Do you think you will make farming your life work, or do you think you may want 
to do some other kind of work later on? (Check one) 


1 T expect to make farming my life work 


42. 


53. 


54. 


56. 


57. 
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2 I expect to work on a farm fora while, but I might later change to some 
other sort of job 

3 I expect to work on a farm only if I cannot get another sort of job I want 

4 I expect to farm part-time only 


Where do you plan to get the land to farm on right after you leave the Army? (Check 
one) 


1 I already own the land in my own name 

2 T plan to buy the land 

3 My family owns the land and I will farm with them 

4 My family rents the land and I will farm with them 

5 I will rent the land from my family but will farm entirely for myself 
6 I will rent the land from some other landlord 

7 I will work on someone's farm for salary or daily wages 

8 I'm not sure yet whose land I will farm on 


Where do you expect to get the equipment and power (tractor or horses) to farm 
with after the war? (Check one) 


1 T already own the equipment in my own name 

2 I expect to buy the equipment 

3 My family owns the equipment and I will farm with them 

4_______ My family rents the equipment and I will farm with them 

5 T will rent the equipment from my family, but will farm entirely for 
myself 

6 I will rent the equipment from someone else 

7 I will work on someone’s farm for salary or daily wages 

8 I am not sure yet whose equipment I will farm with 


If you expect to farm after the war with land and equipment rented from someone 
else, what kind of rent will you pay? (Check one) 


1 I don’t expect to rent land or equipment from anyone 
2 I will pay rent in cash 

3 I will pay rent with a share of the crops 

4 Other. What? 

5 I am not sure what kind of rent I will pay 


. If you plan to rent or buy land in your own name right after you get out of the Army 


do you know what land you will farm on? (Check one) 


1 No, I have no idea what land I’ll farm on 
2 —_____ I know about where Ill farm but don’t know just what farm 


3 I think I know the land I’! farm on 
4 I know exactly what land I’ll farm on 
5 I don’t plan to rent or buy land 


Will you farm by yourself right after you leave the Army, or will you do something 
else first? (Check one) 


1 I plan to do something other than farming for a while first 
2 I plan to farm with my family first of all 

3 I plan to work on someone else’s farm for wages first of all 
4 I plan to rent or buy a farm right after I leave the Army 


If you plan to buy land or equipment, about how much money do you expect to 
pay for it? (Total cost, not just cash payment) 
About $e 
I don’t plan to buy land or equipment 
T haven’t yet made any estimate of what it would cost 


. What equipment do you plan to buy right after you leave the Army? 


I don’t plan to buy equipment 
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(Answer all questions from here on.) 


59. 


61. 


62 


63. 


65. 


66. 


At the present time, how much money do you have saved or invested (in cash, 
bonds, etc.) that you can use to help get started back in civilian life after your dis- 
charge? Do not count your mustering-out pay. (Check one) 


1 Less than $100 

2 $100 up to $200 

3 $200 up to $500 

4 $500 up to $1,000 
5 $1,000 up to $1,500 
6 $1,500 up to $2,000 
7 $2,000 up to $5,000 
8 $5,000 or more 


. How clear an idea do you have of what is in the ‘GI Bill of Rights’? (Check one) 


1 Very clear idea 

2 Fairly clear idea 

3 _____ Not such a clear idea 

4 Not at all clear about it 


Do you think you will try to borrow money under the provisions of the GI Bill of 
Rights after you get out of the Army? (Check one) 


1 Yes, I am quite sure I will try to 

2 I may, but I’m not sure 

3 No, I don’t think so 

4 I don’t know enough about the GI Bill to say 


If you think you may borrow money under the provisions of the GI Bill, for what 
purpose would you want to borrow it? 


I don’t plan to borrow under the GI Bill 


About how much do you plan to borrow? (Check one) 
About $ 

I’m not sure how much I will borrow 

I don’t plan to borrow 


. Do you now own a home in your own name? (Check one) 


1 Yes 
2 No 


What living arrangements do you expect to have right after you leave the Army? 
(Check one) 


1 I expect to live in the home of my parents (or my wife’s parents) 
2 I expect to live in the home of other relatives or friends 

3 I expect to live in a rooming or boarding house 

4 T expect to rent an apartment 

5 I expect to rent a house 

6 _____ I expect to live in my own home 

7 Other. What? 


After you have gotten settled in your work, what kind of living arrangements do 
you expect to have? (Check one) 
1 I expect to live in the home of my parents (or my wife’s parenta) 
I expect to live in the home of other relatives or friends 
I expect to live in a rooming or boarding house 
I expect to rent an apartment 
I expect to rent a house 
I expect to live in my own home 
Other. What? 


2 
3 
4 
5 
6 
7 
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67. After you have gotten settled in your work, how much rent or home payment do 


you think you will be able to pay per month? (Check one) 
1 Under $10.00 per month 


2 $10.00 to $14.99 

3_____ 15.00 to 19.99 

4 20.00 to 24.99 

5 25.00 to 29.99 

6 30.00 to 39.99 

7. 40.00 to 49.99 

8 50.00 to 59.99 

9 60.00 or more 
ll I don’t plan to pay rent or home payment 


68. In this questionnaire we have asked you a lot of questions about your plans for 
after your discharge. Now that you have thought them over a little more, try to 
take everything into consideration and check the one thing you will most probably 


69. 


70. 


71. 


72. 


do first of all—after you have gotten your discharge and taken a vacation. 
1 


came into the Army 


I will probably work for the employer I was working for just before I 


2 I will probably work for salary, wages, or commission for some other 
employer 

3 —___ I will probably go to full-time school or college 

4______ I will probably farm for myself or with my family 

5 I will probably go into business for myself 

6 I am undecided what I will do 


How sure do you feel that this is what you will try to do first of all? (Check one) 


1 I am very sure 

2 I am pretty sure, but might do something else 

3 —_____. Chances are about 50-50 that this is what I will try to do 
4 I’m really not at all sure what I will do 


About how long & vacation do you plan to take before you get down to doing what- 


ever you expect to do? (Check one) 


1 T don’t expect to take any vacation 
2 Less than two weeks 

3 Two weeks up to one month 

4 One month up to two months 

5 Two months up to three months 

6 Three months up to six months 


7 ____ Six months up to a year 
8 A year or more 


Do you plan to go back to live in the same place—the same local area, say within 


5 or 10 miles of where you lived before you entered the Army? (Check one) 


1 Yes, I plan to live there for a long time to come 

2 Yes, I plan to go back there, but only for a short time 

3 No 

4 Undecided 
What kind of a place do you plan to live in right after you leave the Army? (Check 
one) 

1 On a farm 

2 In the country but not on a farm 

3 Tn a village of less than 2,500 people 

4 In a small city (2,500 to 25,000 people) 

5 In a city of 25,000 to 100,000 people 

6 In a large city (more than 100,000 people) 
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73. How many points do you have under the Army Point System—your points for 
length of service, combat, children, etc., under the Score Card Plan? 


Thave _.__SssssssS—S—CSCSsépintz 
I don’t know how many points I have 


Appendix B. Questionnaire Used for Follow-up of July Separatees 


(Please read every question through before you make your answer. Mark some answer to 
every question. If any additional information is needed to explain your answer, please 
write it in.) 

(You need nol sign your name on this questionnaire unless you want to.) 


1. Which of these things are you doing at the present time? (Check the answer or an- 
swers that apply to you—like this \/) 
1 Working for salary, wages or commission 


2 Working in your own business (not being paid a salary by anyone else) 
3 Farming—either for yourself, with your family, or for wages 

4 Going to full-time school 

5 Just resting, visiting friends, etc. 

6 Looking for a job 


7 Something else. What? 


2. How many jobs have you had since you were discharged from the Army? (Don’t 
count it as a different job unless you left one company or employer and went to work for 
another company or employer. If you have done different kinds of work for the same 
company or employer without ever leaving, just count that as one job.) 


1 I have not taken a job yet 

2 _____ I have had only one job 

3 I have had two jobs 

4 I have had three or more jobs 


3. If you are working, what kind of work are you doing now? (Please tell what your 
job 78 called and just what you do.) 
I am not working 


4. About how long were you out of the Army before you started working? 
months and _______. days 


I have not taken a job yet 


5. Are you planning to keep the job you now have? (Check one) 
1 I am not working 


2 Yes, I plan to keep the job I now have, if I can 
3 ______ I will probably keep this job for a while, but may change jobs 
4 No, I plan to leave the job I now have 

If you plan to leave the job you now have, what will you do? 


6. Since you got your discharge, did you go back to work for the employer (company, 
person, etc.) you worked for before you entered the Army? (Check one) 
1 I wasn’t working for an employer before I went into the Army 
2 Yes, I got a job with my old employer and I am working there now 


bee | 
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3 _____ Yes, I got a job with my old employer but I am not working there any 
more 

4 I tried to get a job with my old employer, but couldn’t 

5 No, I asked about a job but decided not to work there 

6 No, I haven’t started working there yet, but I plan to go back with 
my old employer 

7 No, I didn’t try to go back and I don’t plan to . 


. (a) Have you tried to get a job with any branch of the government (Federal, state, 


city, etc.) since you got out of the Army? 


1 Yes, and I am already working for a branch of the government 

2 Yes, I have my application in for a government job and will take 
the job if they offer it to me 

3 I did apply for a government job, but am not sure I want it now 

4 No, I haven’t tried yet, but I plan to 

5 No, I am not interested in a government job 


(b) If you have tried to get a government job, what job did you apply for and 
what did you do to try to get the job? 


. (a) Do you plan to go to full-time school or college? (Check one) 


1 I am going to full-time school or college now 

2 Yes, I plan to go to full-time school or college 

3 ______ I may go to full-time school or college, but I’m not sure 
4 No, I don’t plan to go to full-time school or college 


(b) What school or college will you go to? 


(c) When do you plan to start school or college? 
Month ____+_ seer 


. If you are farming, whose land are you farming on? (Check one) 


1 I am not farming 

2 My family owns or rents the land and I am farming with them 
3 I own the land in my own name 

4 I am renting the land 

5 I am working on someone else’s farm for wages 


Have you made any arrangements for buying or starting a business of your own? 
1 No, I am not planning to have a business of my own 


2 I already have a business of my own 

3 I have been thinking of getting my own business, but have not made 
any arrangements yet 

4 


Yes, I have made certain arrangements for getting a business of my 
own. What arrangements? ~~ — sSSsSCssSCCCCF 


If you were planning to have your own business, have you run into any difficulties 
you had not expected while you were in the Army? 


1 I wasn’t planning to have my own business 
2 No, I have not run into any difficulties I hadn’t expected 
3 Yes, I have run into some difficulties. What difficulttes? 
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12. 


13. 


14. 


15. 


16. 


If you are stil] planning to have a business of your own, when do you plan to get 
started? (Just a rough estimate) 

Month 25 Ver a 
I am not planning to have a business of my own 


(a) Have you applied for any of the benefits of the GI Bill of Rights? (Check one) 
1 Yes 
2 No, but I plan to apply 
3 No, and I don’t plan to apply 
(b) If you have applied, check the benefits you applied for: 
1 Loan to buy a house 


2 Loan to go into business 
3 Loan to buy s farm 
4 Approval of aid for education or training 
Unemployment payments 
(c) Did you receive the aid you applied for? 
1 Yes 
2 T haven’t heard yet 
3 No, my application was turned down 


Before you were discharged from the Army, you probably had some plans about the 
kind of work you would do right after you got out of the Army. Have you been 
able to carry out the plans you had? (Check one) 

1 Yes, I have carried out the plans I had 
: I haven’t tried to carry out my plans yet but still expect to 
4 
5 


No, I tried to carry out my plans but haven’t been able to 
I changed my plans before I really tried to carry them out 


I really didn’t have any definite plans 
In what way have you not been able to carry out your plans, and why was this? 


If you would care to make any suggestions to help the Army do a better job at the 
Separation Centers, or if you have any comments about problems you ran into that 
other soldiers should be told about, please write them here. 


PLEASE CHECK BACK TO SEE THAT YOU DID NOT SKIP ANY QUESTIONS 


APPENDIX 


SOME NOTES ON SAMPLING AND 
QUESTIONNAIRE ADMINISTRATION BY 
THE RESEARCH BRANCH’ 


HE following pages sketch briefly some of the problems involved in sample 
 Teeclestion and questionnaire administration by the Research Branch in its 
operations during the war. To the procedures most commonly used and here de- 
scribed there were many exceptions necessitated by special types of inquiries. 
For example, the experimental studies as described in Volume III, though not 
involving large cross-section samples, required much more meticulous planning in 
handling the situations of questionnaire administration than was necessary in the 
average survey. 

The Research Branch made no new contributions to sampling theory, other 
than the development of a few new formulas for calculating standard errors with 
certain types of data.* The problem of securing a sufficiently representative 
sample of the Army universe covered by a particular study was not an easy one, 
but its solution did not lead to new mathematical constructs. Because the situ- 
ations in which sampling was done in the Army were quite different from any 
which would be encountered in civilian life, it has seemed unlikely that the Re- 
search Branch experience in sampling would prove especially instructive for future 
civilian research. Therefore, the description of the problems encountered and 
methods used is kept very brief here and is presented mainly as a service to those 
readers of preceding volumes who wish to know a little more about the kind of 
operations which lay behind the tables and charts presented there. 

To many who worked in the Research Branch it soon became evident that error 
or bias attributable to sampling and to methods of questionnaire administration 
were relatively small as compared with other types of variation—especially vari- 
ation attributable to different ways of wording questions.? 

In the practical situation in which the Research Branch operated it was neces- 
sary to develop sampling procedures which were (1) swift, (2) economical, and 


1 By Samuel A. Stouffer. Acknowledgment is due to A. J. Jaffe for providing the 
principal data on sampling, to F. Douglas Williams for a memorandum on field work 
from which quotations are made, and to Shirley A. Star for data bearing on special 
methodological problems involved in studying Negro troops. 

2 For example, some simple formulas for testing significance in the case of order statis- 
tics due to Frederick Mosteller. See Volume I, Chapter 4, pp. 156 and 203. Another 
example is a formula for testing the significance of the difference between two correlated 
propositions, developed in the Experimental Section of the Research Branch. See 
Volume ITI, Appendix A; also Volume I, Chapter 4, p. 208. 

3 See, for example, Volume I, pp. 44-46. 
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(3) seemed to be within the limits of error tolerable for the particular study in- 
volved. 

The sampling designs in the initial studies made by the Research Branch were 
developed mainly by Rensis Likert, Quinn McNemar, and S. A. Stouffer, the first 
committee called in by General Osborn to advise on research procedures. As the 
scope of studies increased, a permanent sampling section was set up, headed by 
A. J. Jaffe. When Jaffe went overseas he was succeeded by George Hausknecht 
and later by Robert Dubin, and upon his return from overseas resumed responsi- 
bility. Perhaps no one contributed more to the basic philosophy of the Research 
Branch sampling than Frederick Mosteller, who was consultant on sampling 
throughout the period when the Branch was shifting over from local planning sur- 
veys, exclusively, to cross-section surveys as well. Louis Guttman also helped in 
this field, as well as in scale construction, and was responsible for the sample design 
in several studies. 

Although—perhaps one should say because—several of these men were expert 
statisticians, they kept their eye on practicalities. If it didn’t matter too much 
whether a 5 per cent bias appeared in the data, there was no point in dragging out 
a study several weeks or doubling or trebling the man power involved on it in order 
to achieve greater sampling precision. In fact, as experience grew, there was in- 
creasing confidence in the feasibility of ‘cutting corners” in sampling with no seri- 
ous loss to the ultimate reliability of the findings. On many topics the correlations 
between attitudes and those “demographic” characteristics which could be con- 
trolled by stratified sampling were so low that a sampling with large known bias 
in “‘demographic’’ characteristics would tend to yield about the same attitude re- 
sponses ag one with only a small bias. On other topics more caution was neces- 
sary. Consequently, the sampling design, with the accumulation of experience, 
was tailored more and more closely to the type of attitude data involved in a given 
study. 

Ordinarily, sampling was a two-step process. First, there was sampling of or- 
ganizations. This selection of sample points can be described as purposive or 
quota sampling. Second, within an organization there was sampling of individuals. 
The selection of individuals at a given sample point ideally involved procedures 
which might be called probability sampling. 

In the earliest studies and in many of the later studies, the universe to be sampled 
consisted of a single organization, such as an Infantry division. That particular 
organization may have been selected for one of many reasons, which did not neces- 
sarily involve application of any explicit sampling principles. For example, a 
study was made in November 1943 of the First Infantry Division, selected because 
it was the only division with combat experience in the European theater at the time. 
Later, similar studies were made of three veteran combat divisions in the Pacific 
which were selected personally for study by the Chief of Staff of the Army. Obvi- 
ously, generalizations from such studies to other divisions must be made with 
greater caution than if the data had been based on a random or stratified sampling 
of many units from all veteran combat divisions throughout the Army. But 
within a given division or organization the special Army situation offered facilities 
for probability sampling which often surpassed those in civilian life. If one wanted 
to get a representative sample of a particular organization, that was a relatively 
simple matter—in the ideal case. 


Sampling of Individuals within a Unit 


Let us discuss, first, the process of sampling individuals within a designated or- 
ganization, and defer the discussion of selection of organizations. 
Assume that the study design called for, say, 250 men in a particular organization. 
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The standard procedure was to draw the names of every nth man from a duty 
roster. The men whose names were drawn would then be ordered to appear at 
designated places, like mess halls, at designated hours, for administration of the 
questionnaire in groups of 25 to 50. In the ideal case, this situation was a sam- 
pler’s dream. 

Or, if the study design called for n. men with a high school or college education 
and vm: without, or for n: with high AGCT scores and na with low scores, or for cer- 
tain numbers at a given rank or with a given age, the problem was only slightly 
more difficult. At the personnel office was a file of the soldiers’ Form 20 cards 
containing all such information and much else. For any desired category of men 
these cards could be sampled at random and the men whose names were drawn 
could be ordered to appear at a designated time and place. 

Thus, ideally, the military situation was almost perfect for sampling within 
units. But only in the ideal case. In practice, the ideal was never quite realized 
and frequently the local situation was such as to require improvisation on the part 
of the field team. 

In order to appreciate the problem as it appeared in practice, one must under- 
stand the nature of the authority under which the Research Branch operated. 
General Osborn had full authority to make surveys, but he or officers under him 
could not give orders to commanders in the field. Whether a local commander 
could flatly refuse to cooperate was a problem which was never put to a definitive 
test. Almost always, when the Research officers approached a commanding officer 
at a post or with a unit, cooperation was forthcoming, sometimes grudgingly, but 
nevertheless it was given. Tact, dignity, and persuasiveness were often required 
on the part of the research officer who headed a team, since the commanding officer 
very often would suggest modifications of the sampling design, in the interest of 
minimum disruption of his troops’ time schedule. In a few instances where com- 
manding officers initially refused cooperation, the research officer, after all else 
failed, was able to secure cooperation by using words to this effect: “I am sorry, 
General X, but your units were selected in the Army cross-section sample and I 
have no authority to make substitutions. I can only call the War Department 
and tell them that you find it impossible to cooperate and await further instruc- 
tions from Washington. Incidentally, this will be the first time in our experience 
that this has been necessary.” Such a statement was usually enough, for the 
average commanding officer did not want to risk unknown trouble in Washington. 

There were times, however, when the commanding officer had very compelling 
reasons either for not complying or for suggesting modifications in the sampling 
scheme or the proposed time schedule. For example, the organization might be 
out in the field on a tactical problem, scattered over a wide area in small units. 
To assemble a sample of men at the Research Branch’s convenience might com- 
pletely disrupt the training schedule. The Research Branch field team was in- 
structed to make every reasonable effort to adapt its own time schedule to the 
local situation. Sometimes this meant returning to the camp a week later. Some- 
times the time urgency in Washington was such that it was now or never. In 
this case, some discretion was permitted. If there was an a priori reason to think 
that a new unit could be substituted for the unavailable unit without bias, a sub- 
stitution was made. 

Obviously, the Research officer had to be tactful, resourceful, and conscientious 
as well as thoroughly aware of the hundred and one pitfalls which could produce 
bias in the sample he was sent to get. Here is a concrete example of a situation 
calling for a combination of all these qualities: 


4 From a memorandum by F. Douglas Williams. 
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A Research Branch in an overseas theater agreed to undertake a survey—for a theater 
staff section—among casuals in a Replacement Depot awaiting shipment to the United 
States. It was further agreed that the sample would be a cross section of returnees 
representative of the theater, according to command, or a set of subsamples which could 
be correctly weighted to constitute such a cross section. 

In talking with the adjutant of the Depot, the field officer was told that if every nth 
man were to be taken from a shipping list of men who had been processed, and were 
now awaiting ships, a thoroughly representative cross section of the Depot would be 
obtained—and, further, he knew men from all commands m the theater would be repre- 
sented in approximately their proportionate strength. 

This looked good, on paper. Investigation indicated, however, that the main inter- 
est of the adjutant in this system was that the survey would be restricted to men whose 
processing had been completed, and therefore the Depot procedure wouldn’t be upset. 
Further investigation showed that the number and kind of men, and the command from 
which they came, present in the Depot at any given time varied considerably. Because 
of transportation procedure and scheduling of troops to the Depot, any shipping list 
might have been made up primarily of Ground Force men, or Air Force men, etc. 

It thus became apparent that a sample from the whole depot (including men who 
had not been processed) would have to be drawn, if there were to be even a chance at 
all that the various commands in the theater would each be adequately represented. 

The next step was for the Depot adjutant to suggest that he himself call each bat- 
talion commanding officer, and instruct him to order each of his company officers to 
select every nth man (the of course to depend on the company strength) from a roster 
of the men who would be available at the time of the class. He said some men wouldn't 
be available because of being on KP, on guard, on pass, etc., but that all men received 
these things on the same rotational basis, in which there was no selective factor. 

The field officer persuaded the adjutant that he could save the latter a lot of time 
and trouble if he (the field officer) visited the battalions himself, and made his own 
arrangements. 

Before he was through, the field officer had to go to the companies in the battalions, 
in order to decide on the sampling method that could apply there. The headquarters 
of the battalions claimed to know, but didn’t always, just what kind of a situation ex- 
isted in their different companies. Even companies in the same battalion had different 
practices. 

For instance, some kept up-to-date rosters, and some did not. This made a difference 
as to whether every nth man was selected from a paper record, or from an actual forma- 
tion of the men. 

In some companies all men drew details alike. In other companies sergeants drew 
no details, or only one or two particular kinds. In some companies noncoms received 
more passes than privates; in other companies this wasn’t true. Factors like this made 
a difference in whether the sample could be selected from the men with no other pre- 
arranged occupation (such as guard or KP) at the time of the questionnaire, or whether 
the sample would have to be drawn from all men in the company, in order that it be 
representative. 

By this time another factor had become apparent. There were no adequate records 
in Depot, battalion, or company headquarters which could show the extent to which 
the various theater commands were represented. The Depot had a fluid, swiftly 
changing population, and their paper work wasn’t designed to keep track of matters 
like this. There was no basis for stratifying even approximately by command, or 
stratifying to get a minimum number of men from certain commands. 

Accordingly, the field officer decided to get a representative crogs section of the 
Depot as of a particular time period. Then, at least, the study could be reported on 
that basis. He also increased the size of the total sample, with the hope that then even 
the commands with the smallest number of cases would be large enough to permit com- 
paring them with the commands having larger representation. (Weighting could of 
course be used to arrive at theater averages.) Happily, his hope was borne out in prac- 
tice. 
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Fortunately, most situations were not as complicated as this, but seldom was a 
survey conducted which did not involve problems. Inexperienced Research offi- 
cers learned rapidly after happenings like the following: 


A staff officer in an organization overseas who was very cooperative was permitted, 
contrary to Research Branch instructions, to select the nth men himself from the rosters. 
When he started to check the names he found that one third of the unit’s strength was 
away atarestcamp. So he just substituted names at will. The research officer learned 
of this after the men had filled out the questionnaires. He wasn’t in a position to find 
out much about the basis on which the staff officer had substituted names, and, further, 
the sample couldn’t have been good with that many men absent (selective factors were 
present in determining which men from the unit had been given furloughs). All the 
Research officer could do was discard the questionnaires and substitute another unit. 

In another case, the carrying out of a sampling procedure consisted of a battalion 
commanding officer instructing his adjutant to inform all the company commanders to 
check every nth name off a roster, so that such and such a number of men could be 
selected. The Research Branch officer was too far removed from this operation to 
control it, and he later found out that in practice the prescribed number of men were 
just arbitrarily picked from some of the companies and told to report for the question- 
naire. Again, the questionnaires from that battalion had to be discarded, and another 
organization substituted for it. 

In the third case, the field officer wanted the total strength of a unit for his sample. 
The questionnaire was a short one, and the executive officer convinced the research 
officer that a special order or announcement for a particular time for the men to take 
the questionnaire would not be necessary. Rather, it could be administered during the 
Orientation Hour, which all men always attended. Somewhat more than a half of the 
men turned out to be actually present, and accordingly a biased and insufficient sample 
resulted. The completed questionnaires were of course useless from this unit, and they 
were eventually discarded and a new unit substituted. 


An illustration of the way in which quotas at a particular installation were as- 
signed and the extent to which these quotas were filled is given in Table 1. Camp 
Stewart, Georgia, was one of seventeen installations visited in February 1944, in 
connection with Survey 95. The specific units at Camp Stewart were preselected 
in Washington and quotas assigned as shown in the first column of Table 1, indi- 
cating the number of men to be called. Except in the case of the ist Provisional 
Cadre, the men were to be selected from rosters by the nth man procedure; allow- 
ance was made for about 20 per cent absentees, due to leaves, hospitalization, 
etc., as shown by the expected numbers in the second column.* (In the case of 
the 1st Provisional Cadre, since it was desired to get overseas returnees from speci- 
fied places, special lists of available men were to be drawn up at the camp and 
these lists sampled randomly.) As often happened, the information in Washington 
proved to be out of date. The 561st Anti-Aircraft Battalion had shipped out and 
the Research Branch officer exercised his discretion in substituting the 580th. 
However, the 82nd Finance Disbursing Section also had shipped out and no sub- 
stitute unit was available at Camp Stewart. The Research officer promptly noti- 
fied Washington, which added a comparable unit to the quota of another camp. 

It will be seen from the third column that the number actually reporting was a 
little below estimates in the anti-aircraft battalions. In the case of the Service 


5 From Williams, loc. cit. 

6 Except in special instances, there was no reason to think that a normal percentage 
of absentees on leave introduced a serious sampling bias. Omission of those in the 
guardhouse did represent bias, of course, but such men were few in number. Omission 
of those hospitalized could introduce bias, especially if the complaint was psychoneurotic, 
but these cases of absentees from the roster also were relatively few. 
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Command units the number reporting was above expectation, partly because the 
Research Branch officer interpreted his instructions as permitting calling some- 
what more than 45 or 80 names originally. Table 1 is used illustratively because 
it shows quite typical situations—neither better nor worse than what usually was 
found. 

There were frequent studies requiring special samples which involved drawing 
the respondents from the Form 20 cards, which contained background information 
on the men. For example, a study might require an extra sample of college men 
only or of men with particular job skills or training. It was then necessary to sort 
the Form 20 cards and draw the sample randomly from subsets of these cards in- 


TABLE 1 


ASSIGNED Quotas AND ExrTent oF FULFILLMENT, Camp STEWART, Ga. 
(February 1944) 


Average 
Number number Number 
to be expected actually 
called to report reporting 
Anti-Aircraft Training Command 
297th Anti-Aircraft Battalion 93 74 67 
298th Anti-Aircraft Battalion 89 71 59 
*561st Anti-Aircraft Battalion 95 76 69 
589th Anti-Aircraft Battalion 111 89 75 
590th Anti-Aircraft Battalion 97 68 63 
Service Command Units 
1474 MP Detachment About 45 36 45 
1474 Supply Detachment About 80 54 80 
82nd Finance Disbursing Sect. 55 44 ore 
Overseas Returnees 
910th Anti-Aircraft Battalion 70 56 68 
1st Provisional Cadre 
Men from Caribbean 25 25 27 
Men from Iceland-Greenland 25 25 25 
Men from Africa-Sicily-Italy 25 25 20 


* Shipped out. 580th Anti-Airoraft Battalion substituted. 
™ Shipped out. No comparable unit present. Hence no cases obtained. 


stead of from rosters. Wherever a unit, as was almost invariably the case with 
Negro units, was expected to contain a substantial number of illiterates or semi- 
literates, it was necessary to use the Form 20 cards instead of rosters. Thus the 
men to be personally interviewed could be segregated in advance and sent to spe- 
cial points where trained interviewers were awaiting them. Problems involved in 
personal interviewing will be discussed later in this Appendix. 
Selection of Organizations To Be Included in a Study 

In spite of ali the difficulties encountered in the field, the Army situation usu- 
ally made possible the obtaining of a satisfactory random sample of individuals 
within the units preselected. The greatest chance for bias was in the initial selec- 


tion of the units. 
As has been indicated, the preselection was done in Washington for troops in 
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the United States or at theater headquarters for troops in a given theater overseas. 
The Research Branch had access to the latest secret data on troop strength and 
location of units, although in some rapidly moving periods the information was out 
of date by the time the study went into the field. 

The selection of units involved purposive rather than strictly random sampling. 
If a cross section of enlisted men in the United States or a theater was required, 
the standard procedure was to select the units such that each branch of service 
was represented in as nearly as possible the correct proportions. Within a given 
branch of service, effort was made to get units in various stages of training or with 
various types of Army experience in their correct proportions, in so far as this 
could be inferred from available data. For example, among infantrymen in the 
United States, some would be in Replacement Training Centers, some in newly 
activated divisions, some in divisions scheduled for early shipment overseas. The 
proportions of men in each type of unit could be roughly calculated, and the sample 
was designed to maintain these proportions. 

Having stratified the universe by branch and type of unit, one could proceed to 
draw one or more units from each subcell of this stratified universe. Ideally, this 
drawing would be at random, but practical considerations usually prevented a 
strictly random selection of units within the subcell. Most important was the 
fact that the number of field representatives of the Research Branch, either at 
home or overseas, was limited. Therefore, it was economical and, in fact, neces- 
sary, to limit severely the number of posts to be visited. It was much faster to 
get a variety of units at each of a few posts than one unit each at a large number 
of posts. The number of posts visited depended on the character of the study. 
If the study called for data which would be highly correlated with post location or 
with particular post policies, many more posts would be needed than if the study 
called for data on, say, attitudes toward our allies or desire for overseas service. 
Usually, not more than a dozen or twenty posts were visited on a particular cross- 
section survey in the United States, and several different kinds of units might be 
sampled at each post. 

The need for speed and economy led to a systematic bias in favor of selecting the 
larger posts rather than installations with only a few men. Statistical analyses of 
attitude data showed that there were no differences in most attitudes as between 
men in the two kinds of installations, except on occasional topics like leisure-time 
activities. Ifa survey emphasized the latter, for example, special effort was made 
to get an adequate sample of smaller installations. 

Another practical consideration was the necessity of routing the field teams 
so that a given team could cover a maximum number of installations in a mini- 
mum time. Thus one team would go to Texas, Louisiana, and Mississippi; another 
to Alabama, Georgia, and Florida; another to Oregon and California; another to 
the Carolinas and Virginia; another to Indiana, Kentucky, and Tennessee; another 
to New England; etc. Railroad and air schedules actually were consulted before 
the final decision on camps was made. 

Finally, since sometimes several surveys had to be made in quick succession, 
explicit effort was made to avoid bothering any one installation or any one unit 
within that installation any oftener than necessary. 

When all these considerations were taken into account, it can be seen that a 
large element of purposiveness rather than randomness entered into the selection 
of installations to be visited. 


Checking the Adequacy of the Sample after It Was Taken 


After each survey, the sampling experts checked the returns to see whether 
large biases had occurred on variables for which data were available for the uni- 
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TABLE 2 


CoMPARISONS OF DEMOGRAPHIC CHARACTERISTICS IN RESEARCH BRANCH 
SampLep aND IN AGO 2 Par Cenr SamMp_s or ARMY 


(Percentage Distribution) 
UNITED STATES OVERSEAS TOTAL 
Research Research Research 
Branch AGO Branch AGO Branch AGO 
Rank 
Sergeant 28 26 30 28 29 27 
Corporal 18 16 22 23 21 21 
Pfe 21 21 36 34 31 30 
Private 33 37 12 15 19 22 
100 100 100 100 100 100 
Marital Condition 
Single 49 52 61 62 58 60 
Married, no children 25 25 20 21 22 22 
Married, 1 child 16 14 13 12 13 12 
Married, 2 children 7 6 4 3 5 4 
3 or more children 3 3 2 2 2 2 
100 100 100 100 100 100 
Length of Time in Army 
Over 3 years 27 22 29 23 28 22 
2 years up to 3 years 35 37 46 48 43 45 
1 year up to 2 years 14 18 18 22 17 21 
Under 1 year 24 23 7 7 12 12 
100 100 100 100 100 100 
Length of Time Overseas 
Over 3 years 2 1 2 1 2 1 
2 years up to 3 years 7 7 13 13 11 10 
1 year up to 2 years 8 8 35 39 25 29 
6 months up to 1 year 4 5 24 24 18 19 
Under 6 months 1 3 26 23 18 7 
Never overseas 78 76 — _— 26 24 
100 100 100 100 100 100 
Age 
35 and over 12 13 8 10 9 1 
30-34 17 17 16 17 16 7 
25-29 31 31 31 33 31 33 
20-24 26 29 39 38 35 35 
Under 20 14 10 6 2 9 4 
100 100 100 100 100 100 
Number of cases in Research 
Branch sample 2,417 4,886 7,803 


Research Branch sample, 8-205, May 1945. AGO sample, comprising sbout 2 per cent of entire Army, 
March, 1945, 
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verse—such as rank, length of time in the Army, age, etc. In a few instances, the 
sample revealed a bias—not including quite enough men in the early stages of 
training, for example, or including too many. When feasible, an additional sam- 
ple was drawn immediately to correct such a bias. 

In checking the reliability of its samples, the Research Branch was severely 
hampered, especially in the early years of the war, by the absence of up-to-date 
demographic data on the universe. From the beginning, the Branch had used all 
the influence it could muster to get the Army to adopt a more flexible bookkeeping 
system as to its personnel. Experts from the Branch were loaned to other Army 
agencies at various times to help achieve this. Especially helpful was a periodic 
2 per cent sampling of the entire Army instituted by the Machine Records Branch 
of the Adjutant General toward the end of the war. While this sample was never 
entirely complete, and had some biases which the Research Branch statisticians 
helped discover, it was invaluable in providing a check on some of the important 
demographic characteristics of Research Branch data such as length of time in the 
Army and length of time overseas. 

An example of the extent to which demographic characteristics as derived from 
a 2 per cent AGO sample taken as of March 1945 and a Research Branch cross 
section (S-205) as of May 1945 agreed, is shown in Table 2. In most character- 
istics, as Table 2 shows, the agreement is quite close, though some discrepancies are 
statistically significant. If we assume (as is not entirely justified) that the AGO 
sample is correct, there is probably no attitude item in which overall proportions 
of respondents in any attitude category would be changed by more than 1 or 2 per 
cent by reweighting attitude responses according to proportions in a given AGO 
sample category rather than according to proportions in the Research Branch 
sample. 


TABLE 3 


ScoRES UNDER THE Point System as RerorTep BY MEN IN ResEaRcH BRANcH 
Survey anp as Estimatep spy AGO From OrriciaL Recorps, May 1945 


(Percentage Distribution) 


UNITED STATES OVERSEAS TOTAL 
Research Research Research 
Branch AGO Branch AGO Branch AGO 
85 and over 9 8 14 ll 13 10 
75-84 6 4 ll 9 9 7 
65-74 5 5 16 13 12 10 
55-64 4 6 16 18 13 14 
45-54 9 9 16 19 13 16 
Under 45 67 68 27 30 40 43 
100 100 100 100 100 100 


Research Branch data from 8-205. 


Table 3 compares the distribution of point scores as reported by the men in the 
same Research Branch survey with a distribution of point scores as determined 
independently by the AGO from official records. 

Surveys throughout the war were checked routinely against available Army 
data to determine the extent of bias in the sampling. In view of the magnitude 
of the variations in response which could be induced by even small changes in the 
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wording of questions, the errors introduced by sampling were generally thought to 
be so small as to be relatively unimportant. 


Methods of Questionnaire Administration 


Another potential source of error was in the process of administration of the 
questionnaires. Normally, a sample of about fifty enlisted men were assembled 
by order of their unit commander at a place convenient for writing—a mess hall, 
for example. The class leader was almost always an enlisted man, trained for the 
purpose. His opening remarks would go somewhat as follows: 


You are going to be given some questions to answer. You can answer them just 
exactly the way you personally feel about them. There are no right or wrong answers. 

Believe it or not, the Army is interested in knowing how you personally feel about 
some things in the Army. And in order to be sure that you feel free to say what vou 
think just as frankly as you want to say it, these questionnaires are anonymous. Dw 
not sign your name or serial number. 

When you have finished, put your questionnaire in a pile on the table. Your CO 
will not see any of these questionnaires. 

You may have been wondering why you were selected for this. Well, out of the 
thousands of units in the Army a few units are pulled out of the hat, as it were, from 
the Army. What men in these outfits say will correspond very closely with what all 
the men in the Army will say. And we also have drawn men at random from the 
roster in your outfit. So your name came up by chance and here you are. 

These questionnaires will be sent to Washington (a theater headquarters if overseas) 
and tabulated in statistical machines. Nobody will ever be able to identify your ques- 
tionnaire. Here’s your chance to say what you really think! 


Qualitative evidence of validity of the men’s responses can be derived from in- 
spection of the filled-out questionnaires. With few exceptions, the hundreds of 
thousands of men throughout the world who filled out questionnaires took the task 
with obvious earnestness. There can be little doubt about their frankness. And 
the fact that responses both “hung together” internally and were correlated with 
outside criteria has been demonstrated in the many tables and charts presented in 
Volumes I and II. 

This is not to say that men were not skeptical as to the extent which the Army 
would pay attention to what they wrote. Some of them said quite frankly that 
they didn’t expect the Army to do anything about the situations covered in the 
questions; but such cynicism did not inhibit them from answering, and, indeed, 
from elaborating their remarks in very detailed free comments written on the mar- 
gins of the questionnaires as well as in spaces provided for such comments. 

Several studies were conducted to test explicitly the variations in response intro- 
duced by various methods of questionnaire administration. Since the Army 
situation has few parallels with that in civilian interviewing, which ordinarily can- 
not take advantage of the economies of “‘classroom”’ administration, these studies 
will be reviewed only in brief summary. 

Before the first planning survey was made in December 1941, a methodological 
study was conducted to compare classroom administration and personal inter- 


7 This was rigorously adhered to. An incident which became something of a classic 
story in the Research Branch occurred early in the war. The commanding general of 
a post walked into a hall near the end of a session and started leafing through the ques- 
tionnaires which the early finishers had already tossed on the table. The corporal in 
charge of the table said, “Sir, my orders are that no officer in this camp will be permitted 
to see these questionnaires.’ The general immediately apologized and, returning to 
headquarters, commended the officer in charge of the Research Branch team for having 
an enlisted man with such a sense of duty. 
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viewing. This study, made in November 1941, was designed as follows: 

Group A filled out the questionnaire in the classroom. 

Group B was interviewed personally and individually by trained enlisted men, 
using the same questionnaire. 

Group C filled out the first half of the questionnaire in the classroom and was 
personally interviewed on the second half. 

Group D was interviewed on the first half of the questionnaire and filled out the 
second half in the classroom. 

Each group comprised fifty cases. No significant differences were found be- 
tween any of the groups. Moreover, the respective halves of split questionnaires 
contained questions with parallel forms and the differences between responses to 
parallel forms by the same persons were not significant. The only apparent dif- 
ference between classroom administration and interviewing results was the fact 


TABLE 4 


Reiation or AGCT, Epucation, anp Race to NONRESPONSE ON SURVEYS, 
Cuass-ADMINISTERED QUESTIONNAIRES COMPARED WITH PERSONAL INTERVIEWS 


(February 1943) , 


AVERAGB PERCENTAGE OF 


DESCRIPTION OF GROUP NUMBER OF MEN UNANSWERED QUESTIONS* 
AGCT 
Race Level Education Classes Interviews Classes Interviews 
White I-IV — 217 —_ 2.0% — 
Negro ‘I-III _ 96 _ 1.0 _ 
Negro IV _ 82 _ 2.0 _ 
Negro Vv 8th grade . 47 74 2.5 1.5% 
or more 
Negro Vv 4th-7th 94 98 2.9 1.4 
grade 
Negro Vs Under 4th 86 73 8.6 2.1 
grade 


* This figure is the total number of nonresponses divided by the product of the number of men surveyed 
and the number of questions asked. There were 59 background and attitude items common to the white 
and Negro questionnaires. The Negro questionnaire contained an additional 15 attitude items as well. 


that more detailed free comments were written down by the men themselves than 
by their interviewers. 

In this part of the study men with Class V (lowest) AGCT scores were not in- 
cluded; it was thought that they would probably have to be interviewed personally 
in any case. However, a group of such men were assembled in a classroom and 
questions were read to them aloud, giving each man time to check the answers 
on a questionnaire in front of him. The results checked closely with those ob- 
tained by individual personal interview, but the oral classroom procedure was so 
cumbersome and time-consuming that it was never used in practice. 

In the first year of operations, all Class V men were interviewed routinely. 
Subsequent methodological studies showed that it was not necessary to interview 
the majority of such men as long as questions were simple in form. However, on 
complicated questions the results were better from personal interviewing. As 
time went on, and especially as pressure increased to get results quickly, the Re- 
search Branch tended to dispense with interviewing Class V men, except for a 
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few who when writing in the classroom had obvious difficulties in reading the 
questionnaire.® 

Negro soldiers, as was reported in Volume I, Chapter 10, constituted a special 
problem. So many of them not only were in AGCT Class V, but also had had a 
most meager education, that any study of Negroes necessarily involved a sub- 
stantial amount of personal interviewing. 

Two quite elaborate methodological studies of the problem of eliciting infor- 
mation from Negroes were made, in January 1943 and in February 1943. Using 
as the criterion the number of questions actually answered, it was found that 
Negroes in all but AGCT Class V with less than 4th grade education handled the 
classroom questionnaires satisfactorily. This is shown in Table 4, which compares 
whites in AGCT Classes I to IV with Negroes in various subgroups. In the Negro 


TABLE 5 


Responses BY Necro ENuistep Men From AGCT Crass IV 1n INTERVIEWS 
BY NeaGRozs aS COMPARED wiTH CLAssROOM SITUATION AND IN 
INTERVIEWS BY NEGROES AS COMPARED WITH INTERVIEWS BY 


Wairss 
Elicited by Negro Elicited by Negro 
interviewers as interviewers as 
compared with compared uth 
classroom situation white interviewers 
Exceas in percentage of responses indi- 
cating racial protest (average of 4 
questions) +1% +21% 
Excess in percentage of responses indi- 
cating low personal commitment 
(average of 4 questions) +2 +14 
Excess in percentage of responses indi- 
cating lack of enthusiasm for war 
aims (average of 8 questions) —-6 +8 
Excess in percentage of responses indi- 
cating pessimism about postwar con- 
ditions (average of 9 questions) —-3 +21 
Excess in percentage of responses indi- 
cating unfair treatment in the Army 
(average of 5 questions) +6 +16 
Excess in percentage of responses indi- 
cating lack of high regard for officers 
and noncoms (average of 3 questions) —4 +2 
Excess in percentage of responses indi- 
cating relatively low personal esprit 
or job satisfaction in the Army (aver- 
age of 13 questions) 0 +8 


* Since practically all soldiers could speak and read English, the foreign language 
problem was negligible. In one instance, in Panama, an entire questionnaire was pre- 
pared in Spanish for Puerto Rican troops. 
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sample, where interviews were used, there were Negro enlisted men trained for the 


purpose. 

Special attention was given to differences which might result in using Negro or 
white enlisted men as interviewers. A sample of 180 Negroes from AGCT Class 
IV, for example, was interviewed by Negroes; a sainple of 146 Negroes matched 
with the 130 on AGCT, education, and other variables, was interviewed by whites. 
A third sample of 140 Negroes was given the same questionnaire in a classroom 
situation with Negro class leaders. 

Table 5 shows that there was no essential difference in response between the 
results using Negro interviewers or classroom administration. But the differences 
in response to Negro and white interviewers, respectively, are quite consistently 
large. Negro interviewers tended more than white interviewers to elicit responses 


TABLE 6 


Proportion WriTING Freon Comments about THE Bus Servicr, CLASSIFIED 
BY Oprnion oF Bus Sz#rvics, EpucaTionaL Luvet, anp Race 


PROPORTION VOLUNTEERING COMMENTS AMONG THOSE 
WHO SAID BUS SERVICE WAS: 


RACE AND 
EDUCATIONAL LEVEL 
Very good Fairly good Poor Very poor 
White 
Grade school 12 (819) 16 (674) 27 (247) 39 (256) 
Some high school 21 (188) 26 (481) 41 (240) 468 (266) 
High school graduate 22 (161) 31 (490) 48 (296) 56 (#77) 
College 35 ( 91) 39 (226) 51 (125) 60 (1383) 
Negro 
Grade school 22 (660) 26 (890) 46 (842) 54 (1,299) 
Some high school 20 (116) 37 (286) 51 (814) 58 (784) 
High school graduate 24 ( 50) 44 (190) 62 (209) 70 (484) 
College 44 ( 87) 41 (108) 62 (181) 69 (894) 


8-32, March 1943. Cross sections of white and Negro enlisted men in the United States. Number of 
cases in parentheses. 
reflecting racial protest, unfavorable attitudes toward the war, pessimistic views 
of postwar conditions, unfavorable reports on the Army, and manifestations of low 
personal esprit. On the basis of such findings, the Research Branch routinely 
used Negro interviewers where interviewing was necessary.® 


Who Wrote the Most Free Comments? 


Studies made on many occasions brought out the fact that among all soldiers 
free comments tended to be made most frequently by (a) the better educated and 
(b) the most aggrieved with respect to a particular problem. This fact has been 
stressed frequently in Volumes I and ITI by way of counseling caution in the inter- 
pretation of free comments. Men were encouraged to write free comments, and, 
as already has been noted, the classroom situation was somewhat more productive 


* It is not claimed that the response given to whites is an “untrue” response and that 
given to other Negroes is a “‘true’’ response, even if the latter did not deviate far from 
that obtained in the anonymity of the classroom. Both kinds of responses could be 
relevant, one being a “‘public” response to a white man, and the other a “‘private’’ re- 
sponse. Since in general only one type could be obtained in an actual cross-section 
study, the Research Branch decided to use Negro interviewers and Negro class leaders. 
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of free comments than the personal interview situation. But the lesser educated 
are likely in either case to be less articulate than the better educated. How edu- 
cation and sense of grievance combine to elicit free comments is shown in Table 6. 
Among white soldiers who checked ‘“‘very poor’ in response to a question about 
the bus service, 60 per cent of the college men wrote comments as compared with 
39 per cent of the grade school men. Among those who checked ‘‘very good,” 
35 per cent of the college men wrote comments as compared with 12 per cent of 
the grade school men. Negroes, who had additional grievances against the bus 
service because it often was in fact poorer than for whites and because Jim Crow 
regulations were observed, were more likely than the whites to offer comments. 
This was true in 15 out of the 16 comparable categories in Table 6, by education 
and by opinion of the service. 


The files of the Research Branch in Washington contain much data which fur- 
ther illustrate or document points mentioned in this Appendix. These records 
may be consulted by accredited social scientists who seek further information on 
the methodological problems discussed here. 

Pressed as it was by the necessity of getting reasonably reliable information as 
quickly as possible, the Research Branch made many methodological compromises. 
Its sampling methods were by no means models of what sampling methods should 
be—although the Army situation was in some respects more favorable to the use of 
modern probability sampling methods (once the sample points were located) than 
is often possible in civilian life. Even more methodological studies would have 
been made to experiment with alternative interviewing situations, if time had per- 
mitted and personnel been available. As has been noted, however, the statisticians 
in the Branch were convinced that the biases introduced by sampling designs and 
their field execution were very small as compared with those attributable to the 
wording of questions. Hence, the concern of the Branch became more and more 
to improve its methods of question design—which !ed to the work on development 
of scaling theory occupying the first eleven chapters of the present volume, and to 
the analysis of outcomes of prediction using different types of items, as is described 
in Chapters 12 to 16. 

Progress in theory and research with respect to sampling and interviewing has 
been rapid since the war. In this work government agencies like the Bureau 
of the Census have played a leading role, along with academic and business re- 
search organizations. Forthcoming monographs prepared under the auspices of 
the Joint Committee of the National Research Council and the Social Science Re- 
search Council, on the Measurement of Opinions, Attitudes, and Consumer Wants, 
may be expected to add further to our knowledge. 
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Acceptance of soldier role: 529-530; cor- 
relation of with criterion, 497; defined, 
493, 529; discriminating value of, 479, 
495, 529; list of items for, 529-530; and 
other areas, 496, 497; as scale, 493; 
scores for normals and neurotics, 529 

Acceptance standards, for latent structure 
analysis, 23n, 429 

Accounting equations for latent structure 
analysis: 207, 369-371, 376, 379; equiva- 
lent finite, 377-378; as finite sums, 376- 
378; for general case, 369, 370; as inte- 
grals, 369-375; for linear trace lines, 
374; for point distributions, 377; for 
polynomial trace lines, 375; for quad- 
ratic trace lines, 371, 372; for trichoto- 
mous items, 472; for two underlying con- 
tinua, 453-454 

Accuracy: and external validity, 58, 59; 
of opinions, 59 

Achievement tests, 205 

Additive constant for regression equation, 
539, 544-545 

Adjusted frequencies in latent structure 
theory, see “Response patterns, fitted” 

Adjutant General’s Office, 716, 717 

Adler, Leta, 568n 

Administrative and physical rejection 
rate: for NSA scores, 562; for NSA 
scores and critical signs, 563 

Administrative rejection rate: 550; for in- 
duction stations and service commands, 
551-552 

Administrative rejects and NSA scores 
and critical signs, 566 

AGCT scores and interviews, and non- 
response on surveys, 719 

Age: and business ownership, 656; and 
postseparation business ownership, 662; 
and postseparation interest in education, 
617-618; and postseparation return to 
previous employer, 636; and postwar 
plans for business ownership, 655; and 
postwar plans to return to previous em- 
ployer, 629 


Age distribution: for normals and neu- 
rotics, 512; in U.S. and overseas, 716 
Aggressiveness, see “Childhood fighting 

behavior” 
AGO, see “‘Adjutant General’s Office” 
Agricultural workers, number required, 
640 


Agriculture: soldiers’ plans for, see ‘‘Post- 
war plans for farming”; vs. industry, 
642 

Aitken, A. C., 389n 

Alcoholics, screening of, 511n 

Allport, G. W., 47n, 575 

Analysis of variance, 173, 174, 194 

Andrews, T. G., 4n 

Anonymity and psychosomatic symptoms, 
504, 505 

Anxiety: see also “Psychosomatic com- 
plaints”; and neurosis, 534 

Appel, J. W., 486n 

Aptitude study, 61, 467 

Area of indifference, see ‘Zone of relative 
indifference’’ 

Areas of psychiatric problems: 479, 489; 
see also individual areas; and compari- 
son between cross section and neurotics, 
495; definitions of, 492-493; intercorre- 
lations between, 496, 497; item lists 
for, 514-538; and multiple regression 
weights, 498-499; as scales, 493-494 

Army: attitude toward, which see; concern 
of for personal welfare, attitude toward, 
158 

length of time in: (spring 1944), 570; 
and demobilization, 189; distribution 
for normals and neurotics, 513; distri- 
bution in U.S. and overseas, 716; and 
plans to return to previous employer, 
629; and postseparation return to pre- 
vious employer, 636 

and psychoneurotics, 490, 507 

Army averages at peak strength, and Dec. 
1945 separatees, 628 

Army career: attitude toward, which see; 
“desire” for and “plan” for, 135, 137, 
139 

Army General Classification Test, see 
“AGCT scores” 
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Army rules and regulations, attitude 
toward, 158 

Army Score Card Plan: attitude toward, 
which see; development of, 189-191; 
point scores, distribution of in U.S. and 
overseas, 717 

Army service: and educational plans, 587; 
meaning of, 572; and occupational plans, 
568, 571, 576, 579, 581,623, 640 

Athletes, attitude toward, 158 

“Attempt” to carry out plans, defined, 
586, 609, 659 

Attitude, definition of: 47-57; indirect ap- 
proaches to, 54-57; by Murphy, Mur- 
phy, and Newcomb, 62n (see “Items”’); 
role of prediction in, 50, 51 

Attitude: as a delimited totality of be- 
havior, 49-57; as an inference from pre- 
vious behavior, 49; and opinions, 47, 48, 
87; as a predisposition to act, 49-51, 53; 
unitary, 366 (see also ‘“‘Unidimensional- 
ity’’); “unstructured” and “structured,” 
82-83 

Attitude formation and change, 47n, 82, 
170 

Attitude measurement: 46-59, 60, 86, 
181n, 214; bibliography of, 4n; and 
item analysis, 181 

Attitude research: 47-59, 60, 61, 63, 80, 
86, 87, 88, 121, 150, 168, 169, 172, 173, 
296, 467; bibliography of, 4n, 47n; and 
intensity analysis, 43, 213-276 

Attitude scales, see “Scales,” “Scalogram 
analysis,” ‘Latent structure analysis” 

Attitude toward adequacy of training, 
and desire for combat, 150 

Attitude toward administration of the 
Army Score Card Plan: 236; and atti- 
tude toward the Army, intensity anal- 
ysis of, 243, 250, 251; content scores vs. 
intensity scores, 237, 250, 251; examples 
of content items and of intensity items, 
234, 259; intensity analysis of, 234, 236, 
237 

Attitude toward Allies, and latent struc- 
ture theory, 364, 367 

Attitude toward the Army: 233, 266, 273- 
274, 417-418; and attitude toward the 
administration of the Army Score Card 
Plan, 243, 250, 251; and biased items, 
intensity analysis of, 269-271, 273-274; 
for combat outfits and for technicians, 
in Air Force, 82, 83; content scores va. 
intensity scores, 230, 231, 271, 273; and 
generalized verbal intensity, 262-265; 
intensity analysis of, 230-231, 233 

latent dichotomy analysis of: 21-26, 

28, 29, 383, 417-431; computed latent 
structure for, 28, 427; empirical cross- 
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product matrix of, 419, 421; fitted cross- 
product matrix of, 421; fitted response 
pattern frequencies of, 22, 428; manifest 
data for, 22, 418; precision of, 431; total 
misfit of, 23 

list, of items for: 17, 21, 100-102, 229, 
259, 262, 264, 417-418; favorably biased, 
267-268; for intensity, 229, 259; unfa- 
vorably biased, 267 

of Negroes, for Negro and white in- 
terviewers, 720; and neurosis, 534; 
scalogram of, 118; scalogram analysis 
of, 16, 17, 100-121; scalogram of inten- 
sity of, 228 

Attitude toward: an Army career: list of 

items for, 137; in postwar plans pretest, 
583; scalogram of, 136; scalogram anal- 
ysis of, 133, 135, 137, 139, 141, 583 

the Army’s concern with personal wel- 
fare, 158; one’s Army job, see “‘Satisfac- 
tion with one’s Army job” 

Army rules and regulations: 158; ex- 
amples of items for, 259, 293 

Army Score Card Plan, 158, 234, 236- 
239, 243; athletes, 158 

the British: 214, 215; items for, 214, 
292, 402 

bus service, and race and educational 
level, 721-722; business ownership, 645- 
646, 665; capitalism, examples of items 
for, 459; children, with respect to de- 
mobilization, 189, 190 

civilian support of the war effort: 158; 
example of item, 293 

combat: list of items for, 149; and 
neurosis, 530, 531; scalogram of, 148; 
scalogram analysis of, 145, 149, 150 

the draft, and neurosis, 529; enter- 
tainers, 158 

future results of the war: 158; and 
neurosis, 528 

girls and neurosis, 516, 524-525; gov- 
ernmental employment, 668 

idea of the Army Score Card Plan: 
236; content scores vs. intensity scores, 
238; example of item, 236; intensity 
analysis of, 236, 238-239 

the Inspector-General system, item 
for, 259; internationalism, 158; mail 
censorship, 158; Negroes, methodologi- 
cal discussion of, 51, 52, 176, 177, 467; 
obeying Army rules and regulations, 158 


Attitude toward officers: 167, 168, 226, 


266, 274-275, 448, 452; and biased 
items, intensity analysis of, 269-270, 
272, 274-275; content scores vs. inten- 
sity scores, 223, 227, 272, 274; fold-over 
intensity analysis of, 253; intensity 
analysis of, 42, 222-227, 229 
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Attribute: as qualitative variable, 63, 172, 


latent turnover analysis of: 448-452; 
computed latent structure for, 452; em- 
pirical cross-product matrix for, 450; 
manifest data of, 448, 449 

list of items for: 15, 127-128, 225, 259, 
280-282, 292, 448; favorably biased, 
269; for intensity, 220, 259; unfavorably 
biased, 268-269 

of Negroes, for Negro vs. white inter- 
viewers, 720; and reliability, correlation 
analysis of, 297-299; scalogram of, 126, 
284, 285; scalogram analysis of, 123, 
128, 129, 252n, 280, 282; scalogram of 
intensity of, 224; and split-half lower 
bound to reliability coefficient, 309-311 

Attitude toward: postwar conditions, of 
Negroes, for Negro and white interview- 
ers, 720 

postwar conscription: 239; content 
scores vs. intensity scores, 241; inten- 
sity analysis of, 239, 241; list of items 
for, 135; scalogram of, 134; scalogram 
analysis of, 132-133 

postwar education, 583; postwar yo- 
cational guidance, 624 

Russia: 158, 170, 187-188; scalogram 
analysis of, 188 

socialism, item for, 459 

“sweating out a jump”: list of items, 
147; scalogram of, 146; scalogram anal- 
ysis of, 145 

U.8. participation in war, item for, 
292, 293; use of athletes and entertainers 
in the Army, 158; veteran preference for 
government jobs, 158 

the WAC: 239, 243, 253; content 
scores vs. intensity scores, 240, 244, 245, 
254, 255, 256, 257, 260, 261, 262, 263, 
264, 265; and fold-over technique, in- 
tensity analysis of, 252-253, 255; inten- 
sity analysis of, 239, 240, 252-253, 254; 
intensity analysis of for high, medium, 
and low generalized intensity, 43, 44, 
257, 260-261, 262, 263, 264, 265; list of 
items for, 131, 252; among noncoms, in- 
tensity analysis of, 243, 245; among 
privates, intensity analysis of, 243, 244; 
scalogram of, 130; scalogram analysis of, 
129, 131, 132; using both single and 
separate items, intensity analysis of, 
252-253, 256, 257 

the war: of Negroes, for Negro and 
white interviewers, 720; and neurosis, 
528 

Attitude universe, see “Universe of attri- 
butes” 

Attitudinal items and informational items, 
in scalogram analysis, 131, 182, 141-142, 
152, 170, 234 
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173, 380; as simple function of rank 
order, 69 
Average, see ‘Mean, arithmetic” 


B 


Background characteristics of soldiers: 
571-572, 587, 602, 639, 643, 645, 667; 
(July 1945 separatees), 618, 628, 657; 
(December 1945 separatees), 617, 628; 
list of items for, 695-098, 704; and post- 
separation farming, 644; and postsepa- 
ration return to previous employer, 636 

and postwar plans: for business own- 
ership, 655, 662-663; for education, 596, 
599-600, 601, 617-619, 621-622; for 
farming, 640, 642; to return to previous 
employer, 629 

and prediction, 569-570, 619, 682, 694 

Background items, distribution of re- 
sponses to for normals and neurotics, 
512-514 

“Backwash” in latent turnover, 452 

Bar chart, relation with scale, 68, 72, 73 

Barton, Allen, 413n 

Baseball in boyhood and neurosis, 522 

Basketball in boyhood and neurosis, 523 

Behavior, as necessary condition of defini- 
tion of attitude, 49 

Behavior systems and predictions, 573, 
574 

Bell-shaped trace lines, see ‘‘Middle item” 

Bending points of principal components: 
351-357; alternating signs of, 351-353; 
concavity of first point, 353; invariance 
of, 326, 327; meaning of, 326-327 

Bias: 48, 49, 59, 87, 214, 215, 266, 270, 
273, 275, 367, 481, 482, 601, 679, 711, 
714, 715, 717, 722; of generalized inten- 
sity, 261 

Bimodal response patterns, 440, 445, 468, 
469 

Binet, Alfred, 483 

Binomial distribution: and principal com- 
ponents, 325; standard error of, 279 

Biot, M. A., 359n 

Biseria] correlation coefficient: 199-200, 
and coefficient of reproducibility, 200; 
and scalogram analysis, 61, 63, 199, 200 

Bivariate normal distribution, see ‘Nor- 
mal bivariate distribution” 

Bivariate techniques, and multidimension- 
ality, 63, 200 

Body sway, to screen psychoneurotics, 
487n 

Body-contact sports in boyhood, and neu- 
rosis, 522-523 
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Bogardus, E. 8., 86 

Boundary conditions for difference equa- 
tions in principal components: for 
scores, 343; for weights, 344 

Bowen, H. R., 647n, 655n 

Boyhood fighting behavior, see ‘“‘Child- 
hood fighting behavior” 

Boyhood participation in sports, see ‘‘Par- 
ticipation in sports” 

British, attitude toward, see under ‘“‘Atti- 
tude toward” 

Brown, Warner, 300 

Bureau of Applied Social Research, vi 

Bureau of the Census, 722 

Bureau of Internal Revenue, 646 

Business, definition of, 646, 647, 659-660 

Business firms, number of: and number of 
employers, 646-647; small, decline in, 
655 
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“Cafeteria” items: 187-189; and equal- 
appearing intervals, 187; and scalogram 
analysis, 188 

Camp Stewart, Ga., 713, 714 

Cantril, Hadley, 4n, 40, 47n, 154n, 175n, 
187n, 214n, 216n 

Carnegie Corporation, vi 

Carter, Hugh, 186n 

Case history method of prediction, 575 

Categories: combination of: 16n, 74, 75, 
80, 100, 111, 152, 153, 169n; caution 
towarda, 79, 113, 117, 133, 283, 293; 
and content, 133, 595; example of, 112- 
113; and factor analysis, 209; to im- 
prove reproducibility, 111-115, 153, 210, 
293; and intensity analysis, 222; and 
latent structure theory, 209, 210, 365; 
and number of discriminable ranks, 113, 
133, 209, 210; as performed on one 
scalogram board, 121; to produce vary- 
ing marginal frequency distributions, 
114; on second scalogram board, 114; 
as Step 3 of scalogram analysis, 108-115 

interpretation of meaning of, 151; 
number of, and reliability coefficient, 
303, 306 

number permitted: in latent structure 
analysis, 365, 380; to rank items, 62; in 
scalogram analysis, 74, 78, 79, 80, 117, 
119, 152, 293 

problems of, 152-153, 187, 222 

Category types, frequency of, 334, 335 

“Causal” explanations in public opinion 
polling, 155, 473 

Censorship of mail, attitude toward, 158 
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Census of Business, see ‘‘U.S. Census of 
1940” 

Census of Population, see ‘U.S. Census 
of 1940” 

Centroid method, 498 

Certainty, degree of, 249, 616, 634, 653, 
660, 677 

Certificates of entitlement, 605 

Characteristic functions, 39n, 316n; see 
also ‘‘Principal components” 

Characteristic root, 320n 

Characteristic value, 320n 

Characteristic vectors, 39n, 316n; see also 
“Principal components” 

Charlier, C. V. L., 358 

Chave, E. J., 86n, 185n 

Chebyshev, see ‘““Tchebychef”’ 

Chicago psychiatric rejections and diag- 
noses, 476 

Childhood fears: 516-519; correlation of 
with criterion, 497; defined, 492, 516; 
discriminating value of, 479, 495, 516; 
list of items for, 516-519; and multiple 
regression weights, 498; and other areas, 
496, 497; as quasi scale, 493-494; scores 
for normals and neurotics, 516 

Childhood fighting behavior : 521-522; cor- 
relation of with criterion, 497; defined, 
492, 521; discriminating value of, 479, 
495, 521; list of items for, 521-522; and 
multiple regression weights, 498; and 
Other areas, 496, 497; as scale, 493; 
scores for normals and neurotics, 521 

Childhood health, and neurosis, 519 

Childhood neurotic symptoms: 519~520:; 
correlation of with criterion, 497; de- 
fined, 492, 519; discriminating value of, 
479, 495, 519; list of items for, 519-520; 
and multiple regression weights, 498; 
and other areas, 496, 497; as quasi scale, 
493-494; scores for normals and neu- 
rotics, 519 

Childhood participation in sports, 
“Participation in sports” 

Childhood relations with parents: 514- 
515; correlation of with criterion, 497; 
defined, 492, 514; discriminating value 
of, 479, 495, 514; list of items for, 514- 
515; and multiple regression weights, 
498; and other areas, 496, 497; as scale, 
493; scores for normals and neurotics, 
514 

Childhood school adjustment: 520-521; 
correlation of with criterion, 497; de- 
fined, 492, 520; discriminating value of, 
479, 495, 520; list of items for, 520-521; 
and multiple regression weights, 498; 
and other areas, 496, 497; as scale, 493; 
scores for normals and neurotics, 520 


INDEX 


Children, number of, and demobilization, 
189, 190 

Chi-square, to test goodness of fit, 429, 
457 

Church attendance, pre-Army, and neu- 
rosis, 526 

Circle, for scores and weights, for principal 
components: 36, 316, 317, 328; and cor- 
relation ratio, 316, 320; smallest pos- 
sible, 321 

Circularity of scores, 36, 319; see also 
“Principal components” 

City dwellers and neurosis, 513 

Civil Service, 674, 702 

Civil Service Commission, 675n 

Civilian and military life, and psychoso- 
matic complaints, 489, 502, 506-507, 
508-511 

Civilian support of the war effort, attitude 
toward, 158 

Clarity of postwar plans, 571, 635, 604 

Clark, K. E., 90 

Classification of: preinductees: 548, 550; 
for induction stations and service com- 
mands, 551-552 

psychiatric cases: 553; dominant tend- 
ency among induction stations, 550; for 
induction stations and service com- 
mands, 553-554; and NSA screening 
rate, 556-557 

respondents: 465-469; into broad 
groupe, 467; into latent classes, 468, 469; 
in latent dichotomy, 430; in latent dis- 
tance scale, 445-446; by preliminary 
score, 458; in restricted latent trichot- 
omy, 438-441 

Clausen, J. A., vii, 474, 568n, 587n, 623n 

Claustrophobia, and neurosis, 517 

Cleveland, Stuart, vii 

Closure component: 34, 203, defined, 313n; 
invariance of bending points for, 326; 
and regression weights, 361; shape of 
curve for, 323, 324, 325, 326, 356; shape 
of curve for weights, 329 

Coding of write-in questions, 585, 650, 679, 
682, 687 

Coefficient of improvement, 122; see also 
“Errors for improvement’’ 

Coefficient of reliability: average: 304- 
311; and coefficient of reproducibility 
as lower bound, 305-308; formula of for 
qualitative data, 304, 306; formula of 
for scale scores, 309; and marginal lower 
bound, 304-305; and number of cate- 
gories, 306; and single trial, 304; and 
Spearman split-half formula, 311; and 
split-half lower bound, 309-311; and 
upper bound, 307 

formula for qualitative data, 303; and 
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number of categories, 303; for psychoso- 
matic complaints scores, 503 

Coefficient. of reproducibility: 14, 64, 77, 
89, 100, 117, 218, 222, 280, 305, 457; 
and biserial correlation coefficient, 200; 
formula for, 77, 117, 457; lower accept- 
able limit of, 77, 117, 280, 287, 296; as 
a lower beund to coefficient of reliabil- 
ity, 305-308; and nonscalable area, 288— 
290, 293-294; reliability of, 278, 279- 
280; for single item, 287, 291, 307-308; 
standard error of, 77, 279-280; testing 
consistency of, 77, 78, 280, 282-283; and 
test-retest reliability, 306; for universe, 
77, 287 

Coefficient of variation, of psychiatric re- 
jection rates and NSA screening rates, 
558n 

Cohen, Jozef, 499n 

Cold sweat: and neurosis, 538; nonscalabil- 
ity of, 14; and other psychosomatic 
complaints, 543 

College: see also ‘Postwar plans for educa- 
tion’’; availability of, 604, 606-607, 620; 
and competition, 587; and financial ob- 
stacles, 587; and neurosis, 513; and post- 
war education, 600, 622 

College grades, prediction of, 692n 

Columbia University Bureau of Applied 
Social Research, vi 

Combat: amount of, and demobilization, 
189, 190; assignment to, and mild pay- 
choneurosis, 490n; attitude toward, 
which see 

desire for: and attitude toward ade- 

quacy of training, 150; and neurosis, 
531; and predictions of length of war, 
692n 


readiness for, see “Attitude toward 

combat”; worry about, and neurosis, 
530 

Combat fatigue, 491n 

Combat veterans, reporting fear symp- 
toms, 13, 142 ‘ 

Combination of categories, see under 
“Categories” 

Combination of scores and latent structure 
analysis, 467-468 

Communality: 204-205; for areas of psy- 
chiatric import, 498 

Companies, larger, rate of return to, 637 

Components, see “‘Latent structure theory 
and factor analysis,” ‘Principal com- 
ponents” 

Composite, to predict and to define atti- 
tude, 55, 56 

Computation techniques, of latent struc- 
ture analysis vs. scalogram analysis, 33 

Concave trace lines: 402-407, 432-434, 459 
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Conditional distribution, 361 

Conditional frequency funcvions, 206, 207 

Conditional order, 212 

Conditions of reducibility: 382, 385, 393- 
396, 397, 429, 441, 468; and constant K, 
417, 425, 435; for degenerate latent 
trichotomy, 396; for latent dichotomy, 
416-417, 421, 466; for latent distance 
scale, 411-412, 442, 443; for latent 
trichotomy, 398; for latent turnover, 
409-410; for restricted latent trichot- 
omy, 404-405, 407n, 433-435 

Confidence in one’s leaders, 163 

Congress, 477, 482 

Conscription, attitude toward, see “Atti- 
tude toward postwar conscription” 

Consistency of response, 60 

Constant component, see under ‘Principal 
components” 

Constants of proportionality for principal 
components, 38, 39, 320, 321, 328, 331, 
359 

Constant vector, see “Principal compo- 
nents and constant component” 

Content: and correlation analysis, 85n; 
evaluation of, 83, 84, 167; and intensity, 
method for plotting, 220-221, 226; item, 
effect. of being followed by intensity 
item, 252n; percentile, in intensity anal- 
ysis, 217, 220, 221, 227; and scalogram 
analysis, 85, 119, 129, 154, 313 

Content scores: see also ‘Score, as a value 
of a scale variable”; and generalized 
verbal intensity, 265; and intensity 
scores, 216-218, 219-221, 222, 223, 243, 
270; and nonscalable areas, 221 

Context, influence of for: postwar plans 
studies, 584, 585, 591, 593, 597, 652, 677, 
682; psychosomatic complaints scores, 
503-504 

Contingencies: and education, 588-597, 
619; importance of, 579, 580, 582, 596, 
665, 693, 695; and unidimensionality, 
685-686 

Contingency items: 481, 590-592, 683- 
684, 687; and context, 591; and educa- 
tional level, 594; examples of, 588-589; 
as more or less favorable, 683-684, 685 

Continuum, latent attitude, see “Latent 
attitude continuum” 

Continuum, multivariate, see ‘Multiple 
dimensions” 

Continuum, single, see “‘Unidimensional- 
ity’”’ 

Continuum, vertical, 69 

Convex trace lines, 402-407, 432-434, 459 

Convulsions, as a critical sign, 511n 

Coombs, C. H., 18n, 446 


Cornell trial-scoring and graphic technique 
of scalogram analysis, 91, 143, 144 
Correlation: with criterion, need for bat- 
tery of predictors, 58; as criterion for 
content, 85n, 129 
between items: in latent structure 
theory, 20, 24, 28, 367 (see also ‘“Homo- 
geneous dichotomous system’’); of the 
same type of content, 85 
linear, between principal components: 
39, 204, 329-330; for nonscalable items, 
39 
maximum possible, for given marginal 
frequencies, 297-299 
Correlation coefficients, see under separate 
headings 
Correlation matrix and cross-product ms- 
trices, 394 
Correlation ratio: 336-340; and constant 
component, 322, 340; definition of, 35n, 
194, 314, 337-338; and difference equa- 
tions for scores, 344-347; and difference 
equations for weights, 347-349; maxi- 
mization of, 338, 339; and oscillations, 
353-357; of scores, 337-338; for scores 
or weights, 35, 36, 315, 316, 320, 328, 
340; and “size of circle,” 316, 320-821; 
and standard deviations of components, 
324n, 331; sum of squares of, 39, 322 
325n, 351; of weights, 338-339 
Cost of: coding write-in items, 481; inter- 
views, 582; psychiatric examinations, 
487; scalogram analysis, 121 
Couper, W. J., 623n 
Criteria: for combination of categories, in 
scalogram analysis, 111, 112, 113; for 
construction of an “unbiased” item, 
214; of latent linearity, 446 
of latent structure analysis: 19, 20, 
24; see also ‘Latent structure theory, 
basio postulate of”; approximation to, 
20, 29 (see ‘“‘Fit’’); need for standardiss- 
tion of approximation to, 31, 33 
for ranking respondents, in scalogram 
analysis, 106, 108, 115 
of scalogram analysis: 9, 10, 14, 15, 
17, 39, 40, 64, 78, 158, 287-288, 291, 
293, 300; criticism of, 10; failure to ful- 
fill, 122, 291, 293; outlined, 77-80, 117, 
119, 287-288 
Criterion: as something to be predicted, 
.173, 174, 175, 178, 182, 474, 483, 568, 
575; single, 55 
Criterion dilemma, see “Criterion situs- 
tion” 
Criterion groups: 55; for psychiatric 
screening tests, 490, 505, 507 
Criterion situation: 483-484, 682, 690- 
693; ambiguities of, 474, 477; of psy- 
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chiatric screening test, 490-492, 502, 
539 

Criterion-exclusive screening, 487-488 

Criterion-inclusive screening: 487-489; 
applicability of, 488-489 

Critical ratio: 61, 63; and intelligence item 
analysis, 183n 

Critical scores: 494n, 499, 501n, 503, 506, 
508, 510, 547, 574; and anonymity, 504; 
and critical signs, 511 

Critical signs: 511-512; defined, 511; list 
of items for, 511n; and psychiatric ex- 
amination, 548-549; and screening, 511, 
555, 565, 566; to supplement NSA 
scores, 561-567 

Cross products: first order: 33, 372, 374, 
392, 394, 401n; adjusted (fitted), 421; 
as correlation coefficients, 470; for de- 
generate latent trichotomy, 396; and 
higher order cross products, 398; as an 
integer (unstandardized), 392-412, 420; 
for latent dichotomy, 392, 399; for la- 
tent distance scale, 411; for latent tri- 
chotomy, 396, 397; for latent turnover, 
408-410, 450-451; and links, 405; as a 
proportion (standardized), 374n, 419, 
433, 450; for restricted latent trichot- 
omy, 404 

higher order: computation of, 423- 

424; construction of, 398; for latent 
dichotomy, 400; for latent trichotomy, 
ra for restricted latent trichotomy, 


Cross-product matrix: 393-394; actual and 
fitted, 421-422 
empirical, for: latent dichotomy, 419, 
421; latent turnover, 450; restricted 
latent trichotomy, 433 
and factor analysis, 470; fitted, for 
latent dichotomy, 421; for Guttman 
scale, 460; logarithms of empirical, for 
latent dichotomy, 420; rank of, 394, 455; 
and requirement for positive signs, 415 
schematic, for: latent turnover, 409; 
restricted latent trichotomy, 404 
for Thurstone scale, 460 
Cross section of Army, in psychiatric test 
pretest: 489; correspondence of to in- 
ductees, 602, 505-511; distribution of 
background replies of, 512-514; distri- 
bution of responses of to items of psy- 
chiatric import, 514-538; as ‘“‘effectives,” 
490; and psychoneurotic patients, 492, 
493, 494-497, 499-500, 538n; psychoso- 
matic complaints scores of, 501, 547; 
selection of, 491, 512; size of sample of, 
495, 512 
Crutchfield, R. 8., 4n 
Cultural influences toward scalability, 142 
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Cumulative items, 10-17, 22n, 31, 135, 
141, 460 

Current events, knowledge of, see ‘“Knowl- 
edge of current events” 

Cutting point: to divide respondents into 
pro and con: 42, 46, 47, 48, 49, 59, 213, 
214-215, 216, 249, 275, 482; and item 
wording, 215, 275 

of multiple check lists, for latent 
structure analysis, 21, 22n, 209, 210, 
469; of NSA test, to include psychiatric 
cases, 475, 511, 555, 565 

on psychosomatic complaints scores, 
to separate critical from noncritical 
scores: 501, 510-511; for civilian vs. 
military life, 502-503, 506, 508-511; re- 
liability of, 503 

Cutting points: to divide response patterns 
into broad groups, 27; of multiple check 
lists for scalogram analysis, 15, 16; for 
prediction purposes, 580, 595, 612, 614, 
627, 633, 670, 682, 687 

in vertical continuum of perfect Gutt- 
man scale, resulting from each dichot- 
omy: 69, 110, 114, 115, 162, 331, 333; 
and intensity analysis, 219; relation to 
“solid streaks,’ 110; as separating re- 
sponse categories of distinctly different 
meaning, 110, 111 
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Dark vision, to screen psychoneurotics, 
487n 

Dates with girls and neurosis, 524-525 

Day’s work for the Army, 490, 491n 

Deciles and attitude research, 296 

‘Definite’ postwar plans for education, 
defined, 599 

Degenerate latent structures: fully, 395- 
396, 401n, 403n, 470; partially, 395, 401, 
432-441 

Degrees of freedom, for comparison of ac- 
tual and fitted response pattern frequen- 
cies, 429 

Demobilization, point system for, see 
“Army Score Card Plan” 

Demobilization period: 601; and business 
ownership, 657, 658; and Dec. 1945 
separatees, 628, 661; predictions of un- 
employment in, 571, 606; and programs 
of agencies, 578 

Demographic data on Army: (March 
1945), 716; existence of, 717; need for, 
710 

Department of Agriculture, 477, 569 

Department of Commerce, 569 
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Depression (economic): optimism toward, 
693; worry about, and neurosis, 531 
Depression (personal) and neurosis, 534 
Destructiveness and neurosis, 532 
Detection and prediction, 489 
Determinants: basic, formed from M-mat- 
rices in latent structure theory: 389- 
394; of order higher than A, 389-390, 
393; of order A, 390-392, 393; of order 
less than A, 392, 393 
first order, see “‘Cross products” 
Detroit psychiatric rejections and diag- 
noses, 476 
Diagonal matrix, 387-388 
Dichotomization: biased, 267-269, 270, 
274; into grossly unequal groups, effect 
of upon reproducibility, 78, 287, 290- 
291 
of items in psychiatric survey, con- 
stancy of: for scale pretest and final pre- 
test, 494; for pretest, 512 
of items, as required in latent struc- 
ture analysis: 365, 380; and shape of 
trace lines, 432 
of population by use of single item, 
123, 151; into pro and con, see “Cutting 
point, to divide respondents into pro 
and con’”’ 
of scores, to maximize difference. be- 
tween two groups: 494, 495; examples 
of, 501, 514, 516, 519, 521, 522, 523, 
526, 527, 531, 533; examples of non- 
maximization, 520, 525, 529, 530 
varying, to produce all latent cate- 
gories, 210 
Dichotomous attributes, 67 
Dichotomous items, independent: example 
of, 288-291; scalogram analysis of, 289, 
290 
Dichotomous latent structure, see ‘“Latent 
dichotomy analysis” 
Dichotomous system: 379-384; homogene- 
ous, which see 
Dichotomous weights, for psychosomatic 
complaints score, 499-500, 501, 547 
Difference equations in principal compo- 
nents: and boundary conditions, 343, 
344; and classical orthogonal! functions, 
357-359; and correlation ratio, (for 
scores) 344-347, (for weights) 347-349; 
first-order, 341-342; and oscillations of 
latent vectors, 351-353; for scores, 341, 
342, 343; second-order, 342-344; for 
weights, 342, 343, 344 
Difficulty of choice, 249 
Dimensionality: 47, 48, 49, 83, 88; see also 
“Multiple dimensions” and ‘‘Unidimen- 
sionality”’; changes in, 82, 83 
Disagreeableness and neurosis, 532, 533 
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Dischargees, and prediction, 576-577 

Discipline in the home, and neurosis, 515, 
518 

Discrimina! error, 12n 

Discriminant function: 178, 182; and mul- 
tiple regression, 495n 

Discrimination, institutionalized, scalabil- 
ity of, 157n 

Discrimination between: psychoneurotics 
and cross section, as maximized, 494, 
495, 498n, 538n; probable returnees to 
previous employer and others, 627 

Discriminating value of: areas of psychi- 
atric problems: 479, 495, 501, 514-533; 
for all 15 areas and for psychosomatic 
complaints (or for selected eight areas), 
499; method of ascertaining, 494; and 
multiple regression weights, 498 

NSA, 477, 558-561 
psychosomatic complaints score: 495, 

499; chart of, 501; and detection of psy- 
choneurotics, 502; and varying weight- 
ing, 499-501, 547 

Distribution function of respondents: 369, 
370, 376, 377, 379, 305, 464, 466, 468; 
discrete, see ‘‘Point distribution” ; hypo- 
thetical, 466; moments of, which see; 
standardized, for linear trace lines, 375, 
461-462, 464, 465-466 

Distribution of responses, univariate vs. 
multivariate, in scalogram analysis, 68, 
71 

Distribution surface, 453 

Divorce of parents and neurosis, 514 

Dizziness and neurosis: 536; and other 
psychosomatic complaints, 541 

Dodd, S. C., 86n 

Dollard, John, 483, 485, 689, 690, 691, 692 

“Don’t know’ responses, treatment of in 
latent structure analysis, 402-403, 482- 
433, 469, 472 

Doolittle, M. H., 498, 499, 538 

Double-barreled items, 152 

Dreams of a postwar career, 571, 579, 585, 
645-646, 648, 653, 665, 668, 677, 692- 
693, 694 

“Drop,” see “Link” 

Dropper for insertion of balls in scalogram 
board, 96 

Dubin, Robert, 710 

Dummy variable, as additive constant, 
539, 544, 545 

d-values, for latent distance scale, defined: 
412, 443, 444 

E 


Economic conditions in demobilisa.ion pe- 
riod, 571, 577, 601, 606, 632, 633, 638 
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Education, postwar: see also ‘Postwar 
plans for education’’; number of veter- 
ans enrolled, 606, 622n; per cent of stu- 
dents full-time, 606 

Educational achievement and neurosis, 
521 

Educational bias in mail responses, 587, 


Educational level: and consistency of re- 
sponses, 594-595; and free comments, 
721-722; and length of war, in education 
plans (summer 1943), 594; and neurosis, 
513; and nonresponse on surveys, 719; 
and postseparation business ownership, 
662; and postseparation return to previ- 
ous employer, 636; and postwar plans 
for business ownership, 655; and psychi- 
atric rates, 561; and success of predic- 
tion, 694; and write-in items, 10 

Educational psychology, 61, 194 

Edwards, A. L., 90, 167n 

“Effective,” definition of, 490n—491n 

Ego-involvement and prediction, 692 

Eigen functions, 39n, 316n; see “Principal 
components” 

Eigenvalue, 320n 

Eigen vectors, 39n, 316n; see “Principal 
components” 

Emancipation from parents: 523-525; cor- 
relation of with criterion, 497; defined, 
493, 523; discriminating value of, 479, 
495, 523; list of items for, 524-525; and 
multiple regression weights, 498; and 
other areas, 496, 497; as scale, 493; 
scores for normals and neurotics, 523 

“Emotionally immature” boys, 558n 

Employer, type of, and rate of return to, 
637-638 

Employers, number of, and number of 
business firms, 646-647 

Employment, pre-Army: 571-572, 623, 
625, 628; duration of and plans to re- 
turn, 629, 638; duration of and post- 
separation return, 636, 638; and neu- 
rosis, 526 

Engagement, adjustment in, 176n 

England, see “Attitude toward the Brit- 
ish” 

Entertainers, attitude toward, 158 

Enuresis: as a critical sign, 511n; and neu- 
rosis, 520 

Equal-appearing intervals: and latent 
structure theory, 31, 32, 447, 469; and 
paired comparisons, 187; and reproduci- 
bility of responses, 186, 187; and scalo- 
gram analysis, 185-187, 459; utility of, 
186-187 

Equations: accounting, for latent struc- 
ture analysis, see “Accounting equa- 
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tions” ; of internal consistency, 339-340; 
of scale analysis, see ‘‘Scalogram anal- 
ysis, equations of” 
Error: in intensity analysis, 218-219, 253, 
257, 261, 276, 301-302 
in scalogram analysis: 77-80, 100, 
105, 110, 111, 160, 161, 169, 201, 279, 
287, 294, 307, 311, 361, 595, 612; non- 
random, 129, 280 (see also ‘‘Nonscale 
types’’); and number of categories, 294— 
295 
Error patterns, of scale, nonscale, and 
quasi scale items, 160-163 
Error variance: average for population, 
309; of person’s scores, 308-309 
Errors: close to center columns, in scalo- 
gram analysis, 108 
for improvement: combined, 109, 113; 
as criterion for combination, 113; as 
criterion for rejection of scalability 
(examples), 288-291, 293-294; and 
errors of reproducibility, 117, 288; un- 
combined, 109, 111, 112, 113 
of intensity measurement, 261 
of observation: 302, 307; and repro- 
ducibility, 311 
of reproducibility: 117, 152, 159, 160, 
182, 280; and errors for improvement, 
117, 288; and errors of reliability, 305; 
and random errors of measurement, 
299-300 
of unreliability, 302, 307 
Esprit de corps, definition of, as example 
of known-group validation, 54 
eta-values (n): for latent distance scale, 
412; and links, 405; for restricted latent 
trichotomy, 405, 435 
Ethnocentrism, and latent structure the- 
ory: 364-368, 369, 370; examples of 
items, 364 
Expected value score (EVS), for ordering 
response patterns: 462, 464-465, 466, 
468; and point distributions, 469 
Expert opinion, 214, 215 
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Factor analysis: 4, 9, 18, 29, 197, 201, 204, 
209, 498; and communalities, 204-205, 
498; and latent structure theory, 6, 7, 
19, 29, 205-210, 384, 422n, 469-472; 
and linear relationships, 470-471; and 
point intercorrelations, 201-203, 470; 
and principal components, 313, 330- 
331; and qualitative variables, 191, 192, 
200, 204, 469-470; and quantitative 
variables, 191, 192, 197, 201, 204, 472; 
and scalogram analysis, 45, 163, 191- 
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212; and tetrachoric coefficients, 200- 
201, 470; and zero-order correlations, 
471 

Factor loadings: and latent marginals, 
471; for areas of psychiatric import, 498 

Fainting: and neurosis, 536; and other 
psychosomatic complaints, 540 

False negatives and positives in screening, 
489, 501 

Family, influence of, and neurosis, 513, 
614-515, 518, 524 

Family composition, distribution for nor- 
mals and neurotics, 513, 514 

Farming: see also ‘Postwar plans for 
farming’’; availability of, 642; as life- 
work, 641, 644; and neurosis, 513 

Favorite child of parents and neurosis, 
514-515 

Fear in childhood, see ‘‘Childhood fears” 

in combat: 13, 14, 142; and fear 

among paratroopers, 145 

Fear among paratroopers, 145 

Fear symptoms: list of items for, 141; 
scalogram of, 140; scalogram analysis 
of, 142 

Federal employment, number of veterans 
in, 675 

Ferguson, G. A., 470n 

Festinger, Leon, 90 

Fights in childhood and neurosis, 521-522 

First component, see “Metric component” 

Fishing in boyhood and neurosis, 523 

Fit, in latent structure analysis: of ad- 
justed to empirical cross-product mat- 
rices, 33, 421-422 

of adjusted to observed frequencies: 

23, 414, 428-429, 441, 457, 467; and 
precision, 431; and reproducibility, 456; 


total misfit (for latent dichotomy ex- - 


ample) 23, (for latent distance scale 
example) 457 

Fitted frequencies, see ‘Response pat- 
terns, fitted’”’ 

Fitted response patterns, see under “Re- 
sponse patterns” 

Fixed point of reference, see ‘Zero point’”’ 

Florant, Lyonel C., 568n 

Foldover of scale rank order, for intensity 
component, 37, 321 

Fold-over technique, to derive intensity 
and content scales, 40, 41, 249, 252-253, 
276 

Follow-up studies: of postwar plans: 478, 
568, 577, 581, 586-587, 603-622, 630- 
639, 643-644, 658-666, 672-675; list of 
items for, 706-708; purpose of, 660n 

on psychoneurotic complaints, 506 
Football in boyhood and neurosis, 523 
Football scores, predictions of, 692n 
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Ford, R. N., 86, 587n 

Foreign policy, postwar, attitude toward, 
158 

Form 20 card, 711, 714 

Free comments: and educational level and 
grievances, 721-722; and interviewers, 
719 

Frequencies: fitted, see ‘Response pat- 
terns, fitted”; joint positive, which see; 
manifest, see ‘Marginal frequencies’’; 
marginal, which see; missing, which see; 
ultimate, see ‘Response patterns, fitted” 

Frequency distribution, multivariate, as a 
scale, 64, 88, 89, 91, 93 

Frequency function and content of inter- 
est, 193 

Friendships and neurosis: 527 

F-test of variability between and within 
regions of the U.S., for: NSA scores, 
555n; psychiatric diagnoses, 552n 

Function: simple, 64; single-valued, 63 

“Funnel-shaped interview,” 677 


G 


Gauging Public Opinion, 40 

Gaylord, R. H., 4700 

Generalized verbal intensity: 42, 43, 44, 
253, 256-257, 260-261, 264; and atti- 
tude toward the Army, 262-265 

and attitude toward the WAC: con- 
tent scores vs. intensity scores, 260, 261, 
262, 263, 264, 265; intensity analysis of, 
257, 260-261, 262, 263, 264, 265 
and content scores, 265; and item 

marginals, 263-264, 265; list of items 
for, 259; need for control of, 43; quasi 
scale of, 43, 256; scalogram of, 258; scalo- 
gram analysis of, 256-257; and the 
single item, 262-266 

Geometric mean, for fitted 8-values, 419- 
420 

GI bill: 477, 482, 569, 571, 588, 599, 600, 
606, 617, 642, 645, 647, 656, 660, 691, 
704, 708; amendment to, 606, 620; items 
on, 704, 708; knowledge of educational 
provisions of, 600, 601 

Golf in boyhood and neurosis, 523 

Goodenough, W. H., 90, 91n 

Goodness of fit of latent structure, 23n, 
429 

Gottlieb, M. J., 357n, 358 

Government, understanding of, 668 

Government jobs, veteran preference for, 
attitude toward, 158 

Governmental employment, error in self- 
reporting of, 637n, 667-668 

Grade school and neurosis; 513 
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Gradient errors, 79, 159, 160, 161, 162, 
163, 207, 212 

Gramian matrix, 197, 200, 201, 338, 339, 
346 ; 

Greenwood, Ernest, 155n 

Gregariousness and neurosis, 527 

Group factor, 367, 422 

Group reliability coefficient, see “‘Coeffi- 
cient of reliability, average”’ 

Group Rorschach test, 479 

Guessing answers, 67n 

Guilford, J. P., 4n, 184n, 191n, 194n 

Guttman, Louis, vi, 3n, 4, 5, 6, 7, 9, 18, 
$4, 39, 40, 45, 46n, 56n, 60n, 90, 91n, 
93n, 172n, 178n, 182n, 189n, 194n, 199n, 
277n, 302n, 308n, 312n, 315n, 336n, 362, 
455, 456, 457, 458, 459, 460, 468, 486n, 
498, 499n, 538n, 568n, 710 

Guttman scale, see ‘“‘Scalogram” 


H 


Halo effect, 167-168 

Harris, H. I., 486n 

Harrower-Erickson multiple choice Ror- 
schach test, 479n 

Harvard Laboratory of Social Relations, 
vi, 12, 138n 

Hausknecht, George, 710 

Headaches: and neurosis, 536, 537; and 
other psychosomatic complaints, 541, 
542 

Health: (in childhood) and neurosis, 519; 
and deferment and neurosis, 529; prob- 
lems and neurosis, 535; problems and 
other psychosomatic complaints, 540 

Heartbeat, rapid: and neurosis, 536; and 
other psychosomatic complaints, 541 

Heredity and neurosis, 514 

Hermite polynomials, 357, 358 

Heterosexual relationships and neurosis, 
524-525 

High school: see also “Postwar plans for 
education’’; return to, 600, 622 

High school graduates: see also “‘Educa- 
tional level’; and neurosis, 513; and 
postseparation performance, 636-637; 
(privates) and satisfaction with Army 
job, 243, 248 

Homogeneous dichotomous system, 382, 
383, 393, 417, 422, 427, 438, 453, 470 

Homosexuals, 489n 

“Hookey”’ and neurosis, 521 

Horst, Paul, 4n, 56n, 90, 91n, 178n, 315n, 
538n, 575n 

Hotelling, Harold, 313n 

Hunt, W. A., 486n 

Hunting in boyhood and neurosis, 523 
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Ideal types, and prediction, 573, 574 

Identification with the war effort: 527- 
528; correlation of with criterion, 497; 
defined, 493, 527; discriminating value 
of, 479, 495, 527; list of items for, 528; 
and multiple regression weights, 498; of: 
Negroes, for Negro and white interview- 
ers, 720; and other areas, 496, 497; as 
scale, 493; scores for normals and neu- 
rotics, 527 

Tlliterates: and farming, 643; rate among 
white troops, 643; and sampling, 714 

Imprecision, see ‘‘Precision” 

Ince, E. L., 357n 

Inconsistency of postwar plans for: busi- 
ness ownership, 649, 653; education, 
592-595; education and employment, 
481, 679, 680; farming, 640, 645; gov- 
ernment employment, 669, 672; return 
to previous employer, 625 

Independence, statistical, definition of, 
288; see also ‘“‘“Homogeneous dichoto- 
mous system” 

Index: arbitrary, use of: 174-181; for pre- 
diction purposes, 55-57, 178; for repro- 
ducing responses, 176 

in nonscalable areas: and improper 
weights for prediction, 178-179; and 
lack of criterion for weights, 176-177; 
lack of descriptive meaning for, 175- 
176; and sampling of items, (and de- 
scription) 177-178, (and prediction) 
179-181; and underestimation of pre- 
dictability, 178-179, 180 

for ordering response patterns, 440, 
446, 462, 464, 465, 466, 468 

Indifference, point of, see ‘Zero point” 

Inductees: and NSA scores and critical 
signs, 566 

screened by NSA: 476, 508, 510, 558, 
565; for induction stations and service 
commands, 556-557 

Induction rate: 550; for induction stations 
and service commands, 551-552, 560; 
for NSA scores, 562; for NSA scores and 
critical signs, 563 

Induction stations: and Army situation, 
489, 502, 505-506, 508, 510; Chicago vs. 
Detroit, 476, 477; and disposition of 
preinduction examinees, 551-552, 560; 
list of, 551-552; and NSA scores, 548— 
567; and NSA screening rate for rejects 
and inductees, 556-557, 559, 560; and 
pretest, 508-511, 549; and psychiatric 
diagnoses of psychiatric rejects, 553- 
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554, 559; and psychiatrists, which see; 
and screening tests, 488, 502-512 
Industry vs. agriculture, 642 
Inefficiency: as a criterion, 490, 507; and 
psychiatric diagnosis, 490n 
Information tests, as quasi scales, 162 
Informational items: and attitudinal 
items, in scalogram analysis, 131, 132, 
141-142, 152, 170, 234; and intensity 
analysis, 233-234; as scales, 65-72, 141 
Insomnia: and neurosis, 535; and other 
psychosomatic complaints, 540 
Institute of Mathematical Statistics, 483n, 
689n 
“Tntelligence’’ and neurosis, 521 
Intelligence measurement: 61, 181n, 183n, 
191, 205; and principal components, 313 
Intensity: of attitude, 33, 40, 48, 215, 
216n, 218, 219, 222, 249, 276; and con- 
tent, 41, 219-276, 301-302, 324, 615- 
617; generalized, see ‘Generalized verbal 
intensity”; intrinsic, 253; plotting of, 
vs. content, 220-221, 226; of postwar 
plans, 616, 634, 660, 684 
Intensity analysis: 18, 43, 219-276; see also 
“Intensity component”; basis of, 216, 
275; and content, 42, 236, 243 
content scores vs. intensity scores of 
attitude toward: administration of the 
Army Score Card Plan, 237; the Army, 
230, 231, (with biased items) 271, 273, 
(and expectation of fulfillment of Army 
Score Card Plan) 250, 251; idea of the 
Army Score Card Plan, 238; officers, 
223, 227, (with biased items) 272, 274; 
postwar conscription, 241; the WAC, 
240, 254, (using fold-over technique) 
255, (high generalized intensity) 260, 
263, (low generalized intensity) 262, 
265, (medium generalized intensity) 
261, 264, (among noncoms) 245, (among 
privates) 244, (using both single and 
separate intensity items) 256, 257 
content scores vs. intensity scores of: 
knowledge of current events, 235; satis- 
faction with one’s Army job, 242, 
(among educated privates) 248, (among 
noncoms) 247, (among privates) 246 
fold-over technique vs. separate items, 
249, 252-253; illustrations of, 221-239, 
240-242; and information tests, 233- 
234; and number of items, 301; and pre- 
test, 302; and the problem of error, 218— 
219, 226-227, 253, 257, 276; procedure 
for, 249; relativity of, 239, 243-249, 
250-251 
Intensity component: 5, 34, 35, 40-44, 46, 
156, 157, 203, 213-276, 323, 324; expo- 
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sition of; 37, 38; and regression weights, 
361 

shape of curve: 34, 41, 216, 217, 219, 
221, 222, 223, 227, 249, 270, 275, 276, 
287n, 324, 326, 356; invariance of, 276, 
301, 326; and nonscalable areas, 333; 
and sampling of people, 301-302; for 
weights, 329 

shape of curve, empirical examples of: 
flat-bottomed, J, 238, 273, 274; fist- 
bottomed, skewed, U, 227, 231, 246, 
248, 273, 274; flat-bottomed, symmetri- 
cal, 237, 242, 247, 250; sharp, J, 235; 
sharp, skewed, V, 240, 241, 244, 245, 
251, 254, 255, 257, 263, 264, 265 

Intensity component weights, 328 

Intensity curve, ideal: example of, 216- 
218; sampling error of, 218-220, 222, 223 

Intensity function: 215-216, 217, 219, 221, 
222, 313; see also “Intensity analysis,” 
“Intensity component”; reliability of, 
301-312; validity of, 249 

Intensity median for each content rank, 
220, 223, 270 

Intensity percentile, 217, 220, 221, 226 

Intensity scales as quasi scales, 40, 41, 162, 
218, 222, 231, 234, 249, 301 

Intensity scores: 216, 217, 219, 221; and 
content scores, 216-218, 219-221, 222, 
ae! 243, 270; and fold-over scores, 252, 

5: 

Interest in postwar education, see ‘Post- 
war plans for education,”’ ‘“‘Postsepara- 
tion interest in education” 

oad consistency, equations of, 339 

40 

Internal consistency of respondents, and 
judges, 11, 12 

Internationalism, attitude toward, 158 

Interpretation, problems of, 151-152, 154- 
155 

Interrelations between items: 367; see also 
“Correlation between items,’’ ‘“Home- 
geneous dichotomous system”’ 

Interval scales, 11, 12 

Interventionist sentiment, 214n 

Interviews: advantages of in sifting con- 
tingencies, 581-583, 654, 674; and com- 
bination of categories, 595; of illiterates, 
714; methods of, v, 722; of Negro illiter- 
ates, 643; of Negro soldiers, for Negro 
and white interviewers, 53, 720, 721; 
with nonrespondents, 586; and plans 
classification, 653-654; and question- 
naires, 581-583, 676, 718-721; and read- 
ing items aloud, 719; repeated, s¢ 
“Test-retest reliability,” “Latent turt- 
over” 

Introduction to the Theory of Statistics, 6 
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Inverse matrix, 347, 348, 360 

Irritability and neurosis, 532, 533 

Israel Institute of Public Opinion Re- 
search, 313n 

Italicized frequency in content vs. inten- 
sity tables, defined, 223 

Item analysis: 8, 181, 183; and discrimina- 
tion, 184; and internal consistency, 182— 
185; and latent structure theory, 26n, 
29; objection to, 8, 181, 183, 185; and 
prediction, 181-182, 185; and reproduci- 
bility, 184; and scalogram analysis, 181, 
184 

Item marginals, see ‘“‘Marginal frequen- 

+ ” 

Item types: defined, 317n; distribution of, 
327; number of, and number of principal 
components, 322, 325; and types of re- 
spondents, 334 

Items: characterizing, 167; concave or 
convex, see ‘Parabolic trace lines’; as 
constructed by research worker, 286, 
363, 366; cumulative, which see; as de- 
fined by Lazarsfeld, 363; direct check- 
list, 680-681, 687; endorsed, and re- 
spondent’s rank, 9, 14 

evaluative, in postwar plans studies: 
585, 591, 682; and business ownership, 
660, 682; and education, 614-617, 682; 
example of, 590; predictive value of, 
682-683; and return to previous em- 
ployer, 633-635, 682; and scaling, 684 

favorably biased, 42, 266, 267-268, 
269, 270, 273-274, 275; free answer, see 
“Items, write-in’; independence from 
scale scores, 305, 306, 307; independent, 
see “Homogeneous dichotomous sys- 
tem”; for intensity analysis, form rec- 
ommended, 41, 253; for intensity scales, 
40, 41, 249, 252-253; need for single 
meaning, 60, 151, 185 

Items, number necessary: for intensity 

analysis, 219, 301 
for latent structure analysis: 27, 468; 
and degree of precision, 441; effect of 
increasing number of, 25, 27, 401n, 431, 
441, 468 
for scalogram analysis: 8, 17, 18, 79, 
80, 117, 118, 179; criticism of, 17; if 
dichotomized, 79, 117; effect of increas- 
ing number of, 81, 156, 179, 299-300, 
301; in final study, 300-301; and num- 
ber of categories, 79, 293; and number 
of ranks desired, 79, 296, 300, 301; in 
pretest, 17, 79, 119, 283, 300, 301; in 
quasi scale, 162, 163; and reproducibil- 
ity as a lower bound to reliability, 307 
Items: open-ended, see ‘Items, write-in” 
for prediction, types of: 480-482, 585, 
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687; comparison between, 612-617, 678— 
683, 687-688 

presenting alternatives: 596-597, 648, 
681-683, 687; and direct question, 612— 
614, 665, 686; example of, 597-598; and 
write-in questions, 651-653 

relationship between in latent struc- 
ture analysis, 6, 19, 367, 369 (see also 
“Homogeneous dichotomous system’’); 
sampling of, which see; for scalogram 
analysis, form recommended, 459-460; 
selection of, 150, 151, 163, 182, 185-186, 
363, 366; as simple functions of scores, in 
Guttman scale, 66, 69; and smaller sub- 
sets of items, 17, 18, 79, 81, 151; un- 
favorably biased, 42, 266, 267, 268-269, 
270, 273, 274 

write-in: 480-481, 585, 648, 653, 677, 
678-680, 687; and coding, 585, 650; and 
direct questions, 584, 644, 665, 687-688; 
and evaluative items, 660; examples of, 
679; and farming, 640; and items pre- 
senting alternatives, 651-653 

wording of, which see 

Iterative method, for deriving weights, 

538-539, 546-547 


J 


J curve, see “Intensity component, shape 
of curve” 

Jaffe, A. J., 709n, 710 

Jim Crow regulations, 722 

Job satisfaction, in Army, see ‘‘Satisfac- 
tion with one’s Army job” 

Joint Committee of the National Research 
Council and the Social Science Research 
Council, on the Measurement of Opin- 
ions, Attitudes, and Consumer Wants, 
722 

Joint occurrence matrix, see ‘Matrix of 
joint positive frequencies” 

Joint occurrences: as basis of scale anal- 
ysis, 67; and latent structure theory, see 
‘Joint positive frequencies’ 

Joint positive frequencies, in latent struc- 
ture theory: 370, 371-372, 381-382, 384, 
398, 418; and fitted frequencies, 422; 
mathematical properties of, 381-396; 
matrix of, which see; number of, 381; 
order of, 381; and response patterns, 
381, 418, 429; and trace lines, 372-375, 
459-460 

Judges, utility of, for: evaluating content, 
84, 167; ranking items, 11, 185-186; 
rating ‘‘cafeteria” categories, 187n; se- 
lecting characterizing items, 167 
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Karman, T. V., 359n 

Katz, D. C., 40, 216n 

Kendall, M. J., 381n, 382n 

Kendall, Patricia L., vn, 467n 

Kilpatrick, F. P., 90, 167n 

Knowledge of current events: content 
scores vs. intensity scores, 235; inten- 
sity analysis of, 233-234, 235; list of 
intensity items for, 233; list of items for, 
139, 233; scalogram of, 138; scalogram 
analysis of, 141, 142; scalogram of inten- 
sity of, 232 

Knowledge as nonnegative, 233 

Known-group validation: 54; and inability 
to define, 54, 55 

Krawtchouk (Kravchuk) polynomials, 357 

Krech, David, 4n 

Kriedt, P. H., 90 

Kronecker, Leopold, 360 

Kuder, G. F., 503 

Kuder-Richardson formulas, 503 

Kurtosis of distribution function for linear 
trace lines, 466 

K-value: computation of, 423-425, 434— 
436; and conditions of reducibility, 417, 
425, 435; constancy of for given triplet 
of items, 415, 417, 424-425, 434; for 
latent dichotomy, (3 items) 400, 415, 
423, 448, (4 items) 425; for linear trace 
lines, 375, 464, 465; for restricted latent 
trichotomy, 407, 434, 436; and sampling 
of triplets, 425; and t-values, 426 


L 


Laboratory of Social Relations, vi, 12, 13n 

Laguerre polynomials, 357, 358 

Lambda (A), as number of latent classes, 
382, 385, 390 

Latent attitude continuum: 6, 12, 32, 363, 
364, 366, 367, 370, 380, 402, 408, 432, 
458, 459, 461, 407; utility of, 7 

Latent attribute: 19, 20, 22-29, 414; pro- 
portion possessing: 23, 25, 429; con- 
stancy of, 452; convergence of as further 
items are added, 25, 27, 431 

Latent category, 206; see ‘Latent class” 

Latent character, see “Latent attribute” 

Latent characteristic, 414, 470 

Latent class frequencies: 384, 387-388, 
454; computation of for: latent dichot- 
omy, 415, 425; latent distance scale, 412, 
444: restricted latent trichotomy, 435— 
436, 437 


for Guttman scale, 456; and latent 
marginals, 399 

Latent class position, see ‘Classification 
of respondents” 

Latent class structure: 376-379, 468; of 
Guttman scale, 456 

Latent classes: 6, 19, 376, 380, 383-385, 
388, 430; as discrete simplification of 
continuous trace line, 32, 33, 376-379 

number of: 19, 32, 414, 441, 467-469: 
and basic determinants, 389-392; and 
degree of polynomial trace lines, 467; 
increase of, 32, 397; infinite, see ‘Trace 
lines’’; and number of items, for Gutt- 
man scale, 456; three, see ‘Restricted 
latent trichotomy,” “Latent trichotomy 
analysis’; two, see “Latent dichotomy 
analysis” 

ordering of, which see 

Latent classification, see ‘Classification of 
respondents” 

Latent continuum, see ‘Latent attitude 
continuum’’ 

Latent data: 363, 383; and manifest data, 
364, 369-372, 374, 377, 379-396, 414- 
452, 456, 472 

Latent dichotomy analysis: 19-29, 382- 
384, 414-432; analytic explanation of: 
27-28, 391-392, 393-394, 399-401; for 
3 items, 399-401, 415-416; for 4 items, 
416-431; for 5 items, 391-392; for m 
items, 27-28, 392 

computation of, 422-432; computed 
latent structure of, 28, 427; empirical 
examples of, 21-28, 382-384, 417-431, 
448-449; and latent distance scale, 447; 
latent structure of, 27, 416; and linear 
trace lines, 468-469; schematic example 
of latent structure of, 461; as special 
case of latent structure analysis, 19-20, 
29, 208, 401, 415, 422; of two latent 
dichotomies, see ‘Latent turnover’’ 

Latent distance scale: analytic explanation 
of, 30, 410-412; computed latent struc- 
ture of, 31, 444; conditions of reducibil- 
ity for, 411-412, 442, 443; empirical ex- 
ample of, 441-446, 457; general, 447; 
and latent linearity, 446; latent struc- 
ture of, 410, 442; and ordering of latent 
classes, 446; and perfect Guttman scale, 
30, 455, 457; and restricted latent 
trichotomy, 447 

Latent distribution, among latent classes: 
445-446; bimodal, 440, 446; standard 
deviation of, 446 

Latent fourfold table, 408, 449-450 

Latent linearity: 446-447, 453; analytic 
explanation of, 31-32; definition of, 446; 
and latent turnover, 447, 453; and quasi 
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scales, 458; schematic example of, 447; 
and trace lines, 446-447 
Latent marginals: 384, 387-388, 393, 396, 
401n; and adjusted S-values, 427; and 
factor loadings, 471; and higher order 
cross products, 398; for latent dichot- 
omy, 399-400, 416, 427; for latent dis- 
tance scale, 411-412, 443-444: for latent 
turnover, 408; number of, 385; for re- 
stricted latent trichotomy, 403, 437- 
438; simplified notation for, 397 
Latent parameters: 378, 384, 389, 392, 393, 
411-412, 442; and latent structure equa- 
tions, 385-386; and manifest data, 379- 
396, 414-452; and missing frequencies, 
394; number of, 385 
Latent position, average, 445-446 
Latent probabilities, 376, 378, 429-430 
Latent roots: 320n, 340; see also “Corre- 
lation ratio’; and constant vector, 346; 
distinctness of, 349-351; sum of, 351 
Latent set: 206; and manifest set, 211 
Latent structure: computation of, 413- 
454; defined, 378, 379, 395 
empirical, for: latent dichotomy, (four 
items) 28, 427, (three items) 449; latent 
distance scale, 31, 444; latent turnover, 
452; restricted latent trichotomy, 438 
equivalent: 376-379, 403, 447; and 
trace line model, 466-469 
hypothetical, for: \ classes, m items, 
29; latent dichotomy, (2 items, sche- 
matic) 461, (4 items) 416, (m items) 27; 
latent distance scale, 410, 442; latent 
trichotomy, (four items) 397, (six items) 
385; latent turnover, 408; m noncumu- 
lative items, 31; perfect scale, m items, 
30, 455; quasi scale, m items, 30, 410; 
restricted latent trichotomy, 403, 434 
interpretation of, 413-472; of latent 
dichotomy, numerical computation of, 
422-427 
and manifest data: 379-396; and de- 
generate structures, 395 
of scalogram pattern, 455-460 
Latent structure analysis: see also ‘“‘Latent 
structure theory’’; and accounting equa- 
tions, 369-371, 374, 375, 379, 453-454, 
472; analytic explanation of, 29-30, 
206-207, 368-369, 384-394, 397-412; 
basic matrix equation of, 388, 389; and 
classification of respondents, 430, 438- 
441; and computational short-cuts, 
413n, 471; and conditions of reducibil- 
ity, 393-396; and derivation of specific 
latent structures, 397-412; and the 
dichotomous system, 379-384; general 
background of, 363-379; and the fitted 
response patterns, 22-26, 427-431, 438- 
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441, 444-446; illustrations of, 21-28, 
413-454; and linear trace lines, 373-375, 
414; and logarithms, 419-421, 451; logi- 
cal and mathematical foundation of, 
362-412; and matrices of manifest joint 
frequencies, 387-392; and the most ap- 
propriate segmentalization, 467-469; 
and ordering of latent classes, 380, 384, 
411, 446, 453, 468; and point distribu- 
tions, 376-379, 380, 454; and polynomial 
trace lines, 371-379; and precision, 430- 
431, 440-441; preliminary tests for, 33, 
421-422; and pure tests, 366-370, 377, 
422; purpose of, 380; and scaling, 446; 
and special structures with more than 
two latent classes, 432-454; and test 
construction, 454-472; and trace lines, 
363-379, 395, 402, 414, 432, 446-447; 
and variability, 25 

Latent structure equations: 384-386, 397; 
for latent dichotomy, (3 items) 399-401, 
415-416, (4 items) 423-427; for latent 
distance scale, 411-412, 443-444; and 
latent parameters, 385-386; for latent 
trichotomy, 385, 397-399, 401n; for 
latent turnover, 409-410; and number 
of latent classes, 397; for restricted 
latent trichotomy, 404-408, 434-438 

Latent structure theory: 6, 7, 19-33, 206- 
211, 362-472; see also ‘Latent structure 
analysis”; basic postulate of, 6, 19, 21, 
29, 30, 206, 207, 367, 368, 380, 385, 388, 
454; empirical applications of, 21-28, 
413-452; and factor analysis, 6-7, 19, 
29, 205-210, 384, 422n, 469-472; math- 
ematical development of, 381-396; 
and perfect scale pattern, 210-211; 
and quantitative material, 471-472; and 
quasi scales, 7, 30, 207, 211, 458; and 
reducibility, 381-384, 393-396; and re- 
lation to other techniques, 454-472; 
and sampling theory, 414, 422, 429, 437; 
and scalogram analysis, 7, 20, 22, 30, 
33, 45, 210-211, 381, 455-460; utility of, 
33, 381, 446, 452, 469, 471 

Latent subclasses or subsystems, see ‘‘La~ 
tent classes” 

Latent trichotomy analysis: and addi- 
tional items, 401n; analytic explanation 
of, 384-385, 387-388, 390-392, 393, 397- 
399; conditions of reducibility, 398; 
fully degenerate example of, 396; latent 
structure of, 385, 397; partially degener- 
ate example of, see ‘‘Restricted latent 
trichotomy”; restricted, which see 

Latent turnover analysis: 33, 408-410, . 
422, 447-452; analytic explanation of, 
408-410; computed latent structure of, 
452; and cross product of latent fourfold 
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table, 451; cross-product matrix of, 409; 
empirical example of, 448-452; latent 
structure of, 408; terminology of, 448n, 
450 

Latent vectors: 39n, 316n, 340, 346, 349; 
see also ‘‘Principal components” ; number 
of oscillations of, 353-357; orthogonal- 
ity of, 349-351; oscillatory nature of, 
351-353 

Law of comparative judgment, 12n 

Lazarsfeld, P. F., vi, 3n, 6, 7, 9, 19, 27, 32, 
45, 206, 207, 208, 209, 210, 211, 219n, 
362n, 413n 

Leadership, 167 

Learning theory, 483-485, 690-691 

Least squares: 65, 174, 193-195, 198, 199, 
500, 539; method of scalogram analysis, 
91, 143, 144; to obtain metric in scalo- 
gram analysis, 314, 318 

Legendre polynomials, 357 

Leisure-time activities and smaller instal- 
lations, 715 

Level of difficulty, 14, 15 

Lexical order of joint occurrence matrices, 
387 

Likert, Rensis, 249, 710 

Likert item, example of, 252 

Linear order of latent classes, see ‘Latent 
linearity” 

Linear trace lines: 365-366, 373-375, 402, 
414, 461-463; 468-469; coefficients of, 
374; and distribution function, 374, 468; 
equations of, 374; example of, 365, 463; 
and higher moments, 375; and manifest 
data, 374; and ordering of response pat- 
terns, 465; and rank of matrix, 373; for 
standardized distribution function, 375, 
465, 466 

Link: defined, 391, 423-424; as depending 
on signature of matrix, 391-393; in 
latent dichotomy, 400, 415, 417, 423; 
in latent trichotomy, 401n; multiplica- 
bility of, 391-392, 393; and rank of joint 
occurrence matrices, 393; and ratio of 
cross products, 405 

Liouville, Joseph, 357 

Literacy and NSA, 550n 

L-matrix, 387-388, 390 

Loevinger, Jane, 4n 

Logarithms, utility of in latent structure 
analysis, 419-421, 451 

“Low” score, in psychiatric areas, de- 
fined, 495 

LST, see ‘Latent structure theory” 

Lyon, Blanchard, vii, 413n 


M 


M as matrix, 386-394 
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M as moment, see “Moments of distribu- 
tion function” 

Machine Records Branch of AGO, 717 

Magnet, use of, 96, 114 

Mail censorship, attitude toward, 158 

Mail follow-up questionnaire, returns 
from, 586-587 

Malingering, 479, 489, 504 

Manifest data: 363, 368, 383, 395, 418; 
and degree of trace lines, 372, 373; and 
latent data, 364, 369-372, 374, 377, 378, 
379-396, 414-452, 456, 472; and poly- 
nomial trace lines, 373-375; in trichot- 
omous form, 471-472 

Manifest frequencies or marginals, see 
“Marginal frequencies” 

Manifest joint occurrence matrix M, 386, 
393 

Marginal frequencies: 369, 372, 384; com- 
puted from latent marginals, 461; diver- 
sity of, as related to number of rank 
groups, 114, 133; in joint occurrence 
matrix, 386; and moments of distribu- 
tion function, 371, 372; as proportions, 
372, 374n 

Marginal lower bound to reliability coeffi- 
cient, 304-305 

Marital status, distribution of: for normals 
and neurotics, 513; in U.S. and overseas, 
716 

Market research, 18 

Marriage: and neurosis, 513; and post- 
separation interest in education, 617- 
618, 691 

Marriage research, 60 

Mass communication, v 

Material shortages, 482, 657, 666 

Mathematical models in factor analysis 
and latent structure analysis, 469 

Matric inversion by differencing: for 
scores, 344-347; for weights, 347~349 

Matrix of: higher-order joint positive fre- 
quencies: 373, 386-387; arranged in 
lexical order, 387; and basic determi- 
nants, 389-394; and factor analysis, 471 

joint positive frequencies: 372, 386, 
470; and basic determinants, 389-394; 
and the latent structure, 387-392; rank 
of, which see 
nonscalable ares, 341; point intercor- 

relations, 202; a scale of dichotomies, 
334 

Matrix algebra, 387-394 

Matrix equation of latent structure anal- 
ysis, 388 

Maximum class, 446 

Maximum probability, to predict qualita- 
tive variates, 194 
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Maximum probability score, for ordering 
response patterns, 462, 464, 465 
McNeomar, Quinn, 47n, 2138, 214n, 710 
McPesk, William, 568n 
Mean, arithmetic: 193, 220; and ordering 
of response patterns, 445~446; and qual- 
itative variates, 194; and quantitative 
variates, 195 
Meaning of single item in polling, 87, 151 
Measurement: 3—472; problems of, 46-59 
Measurement of error, 77-80 
Medial items, and latent distance scale, 
442 
Median, to rank response patterns, 445 
Median content percentile of respondents 
with lowest intensity scores, 217, 227, 
229 
Median of intensity percentile: defined, 
220, 221, 226; sampling error of, 302 
Medical and administrative rejection rate 
for NSA scores: 562; and critical signs, 
563 
Medical and physical rejection rate: 550; 
for induction stations and service com- 
mands, 551-552 
Medical rejects: and NSA scores and criti- 
cal signs, 566 
screened by NSA: 476, 558; for induc- 
tion stations and service commands, 
556-557 
Mental testing, see ‘Intelligence measure- 
ment” 
Merton, R. K., vn 
Methodology in attitude research, bibli- 
ography of, 47n 
Metric: for categories, 327-329; for cate- 
gories vs. for people, 315 
internal: 313, 326, 330; and equations 
of scale analysis, 314; and rank scores, 
325-326 
for people: 317-327, 331; and rank 
order, 318 
problem of, 65, 312-313 
Metric component: 321, 323, 341; and 
other components, 329-330, 333, 353n; 
and regression weights, 361; scores, 36, 
323, 325; shape of curves, 323, 324, 325, 
353n, 356; shape of curve for weights, 
328 
Middle item, 365, 447, 459 
Middle West of U.S.: and NSA scores, 
558; and variations in psychiatric diag- 
noses, 475; and variations in psychiatric 
rejection rate, 554 
Midpoint of content percentile, defined, 
220, 221, 226 
Minimum intensity point, see ‘Zero point” 
Missing frequencies: 386, 387, 394; nu- 
merical value of, and reliability of, 304 
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M-matrix, 386-391, 393 

Mobility (geographical) as a psychiatric 
area: 525-526; correlation of with cri- 
terion, 497; defined, 493, 525; discrimi- 
nating value of, 479, 495, 525; list of 
items for, 625-526; and multiple regres- 
sion weights, 498; and other areas, 496, 
497; as scale, 493; scores for normals 
and neurotics, 525 

Modal category, 303 

Modal probability: 303-304, 429-430; 
average, 304-305, 306; and bimodal 
types, 440, 445, 468; lower bound for 
average, 304; in perfect Guttman 
scale, 456 

Mode: and prediction of qualitative vari- 
ate, 194, 199, 287-288, 308; to rank re- 
sponse patterns, 445 

Moments of distribution function: 371- 
372, 465n; and item marginals, 371, 374; 
and linear trace lines, 374, 375, 465-466; 
and polynomial trace lines, 375, 469 

Morale: 163; example of item, 259, 292, 
293 

Mosteller, Frederick, vii, 12, 175n, 429, 
466n, 709n, 710 

MP3, see “Maximum probability score” 

Multiple correlation, 55, 56, 65, 78, 79, 89, 
156, 159, 162, 175, 178, 188, 186, 192, 
361, 493, 494, 495, 498n, 500, 547, 686 

Multiple dimensions: 6, 9, 18, 45, 55, 129, 
131, 145, 150, 154, 159, 160, 188, 189, 
190, 191, 305, 367, 368, 380, 422, 572- 
573, 585, 595, 624-625, 631; and latent 
turnover, 408, 453; and meaning of cut- 
ting point, 48-49 

Multiple factor analysis, 18, 191, 206, 209 

Multivariate distribution, empirical vs. 
perfect, in scalogram, 77; see also “‘Co- 
efficient of reproducibility” 

Murphy, Gardner, 47n, 61, 62n, 63 

Murphy, L. B., 47n, 61, 62n, 63 

mu-value (x), see “Link” 


N 


Nail biting: distribution for normals and 
neurotics, 519, 537; and other psychoso- 
matic complaints, 542 

Narcotics, as a critical sign, 511n 

National Research Council, 722 

“Negative” responses, treatment of in 
scalogram analysis, 100, 202, 105 

“Negative-neutral’”’ responses, treatment 
of in scalogram analysis, 103, 105 

Negro soldiers: and interviewing, 720; 
Negro vs. white interviewers, 53, 720, 
721; and sampling, 714 
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Nervous breakdown: as a critical sign, 
51in; of family member, and neurosis, 
514 

Nervousness: and neurosis, 536, 537, 538; 
and other psychosomatic complaints, 
541, 542, 543; and Rorschach tests, 4790 

Neurologically unfit, 511n 

Neuropsychiatric Consultants’ Division of 
SGO, 486n, 549 

Neuropsychiatric Screening Adjunct: 
adaptation of to induction station 
screening, 502-512; administration of, 
512; defined, 511; derivation of, 489- 
492, 499, 511-512; list of items for, 511n, 
535-538; and logic and procedure, 486- 
512; original design of, 489n; and other 
areas of psychiatric import, 492-499; 
and psychiatric examination, 548-549, 
558-559 

scores on: and education, 561; fre- 
quency distribution of, 474-475, 566, 
(for pretest) 501, 508; and preinduction 
examinees, 561, 562, 566, (and critical 
signs) 562, 563, 566; and psychiatric 
diagnoses, 548-549, 555-561, (and criti- 
cal signs) 564, 566 

scoring of, 499-501 

screening rate: 555, 558; and all types 
of psychiatric rejects, 559; for each type 
of psychiatric rejects, 475, 558; for in- 
ductees, 476, 558, 565; for induction 
stations and service commands for each 
and all types of rejects and inductees, 
556-557, 560; for medical and physical 
rejects, 476, 558; for psychiatric rejects, 
475, 558, 565, (and psychiatric diag- 
nosis rate) 559n; and psychiatric screen- 
ing rate, 476, 558, 560-561; range among 
induction stations, 555; variability 
among regions of the U.S., 561 

use of, 474, 486, 489, 508, 512, 548- 
549, 567; and verbal behavior, 559 

Neurosis, see ‘‘Psychoneurosis”’ 

Neurotic inventory, see ‘Psychosomatic 
complaints” 

Neurotic symptoms in childhood, see 
“Childhood neurotic symptoms” 

‘Neutral’ responses: and linear trace 
lines, 469; treatment of in scalogram 
analysis, 103, 105, 106, 111, 113 

“Neutral-neutral’”’ responses, treatment 
of in scalogram analysis, 103; sce also 
“Neutral’’ responses 

Neutrality: point of, see ‘‘Zero point”; 
range of, see ‘‘Zone of relative indiffer- 
ence” 

New England: and NSA scores, 558; and 
variations in psychiatric diagnoses, 475, 
554 


Newcomb, T. M., 47n, 61, 62n, 63 

Nightmares: and neurosis, 538; and other 
psychosomatic complaints, 543 

1948 Presidential election, 676 

N-Matrix, 387-388, 390 

“No answers,’”’ treatment of: in psychiatric 
survey, 512; in scalogram analysis, 100, 
103, 153 

“No opinion,” treatment of: in latent 
structure analysis, see ‘ ‘Don’t know’ 
responses’; in scalogram analysis, see 
‘* ‘Neutral-neutral’ responses’ 

Noland, E. W., 90 

Nomograph relating K and t, 426 

Nonads of cross products, 401n 

Nonattitudinal variables and scaling, 172 

Non-body-contact sports in boyhood and 
neurosis, 522-523 

Noncoms [Noncommissioned officers]: and 
attitude toward the WAC, 243, 245; and 
privates, 243; and satisfaction with 
one’s Army job, 243, 247 

Nonconstant principal components, num- 
ber of, 322, 325 

Noncritical scores, 494n, 501n, 503, 508 

Noncumulative items: 11, 15, 31; permit- 
ting ordering of respondents, 31, 32 

Nondirective interviewing, 677 

Nondirective items, see “Items, write-in” 

Nonlinearity of latent classes, 447, 453 

Nonnormal distributions and quantitative 
variables, 209-210 

Nonpsychoneurotic adjustment, in areas 
of psychiatric import, defined, 492-493 

Nonpsychoneurotics, see ‘‘Cross section of 
Army” 

Nonresponse, and AGCT, education, and 
race, 719 

Nonscalable areas: and addition of items, 
180, 300; and discrimination between 
ranks, 184; and equations of scale anal- 
ysis, 341; examples of, 145, 158, 176, 
291-293; and index, 175-181; and inten- 
sity analysis, 221; prediction in, 156, 
174, 178-179, 180, 186; and principal 
components, 39, 320, 332-333, 341, 360; 
and quasi scales, 159-163; rank order 
in, 154, 176; reproducibility of responses 
in, 176, 184, 288-290; and sampling of 
items, 177, 179, 180, 300, 333; and scal- 
able areas, 158, 159, 175, 176, 179, 332- 
333; scalogram analysis of, 288-295; 
treatment of, 173 

Nonscale items: 84, 85, 128, 129, 131, 132, 
158, 166, 168; examples of, 160, 161, 
289; and prediction, 480, 686 

Nonscale response patterns, 446 

Nonscale and scale approaches, 172-174 

Nonscale types: 6, 39, 78, 119, 145, 159, 
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160, 162, 169, 176, 207, 446, 599, 641; 
examples of: 145, 146, 148, 149, 150, 
198-199 

Nontrivial principal components, see 
“Nonconstant principal components” 

Nonverbal behavior, as attitude, 48 

Normal bivariate distribution, 195, 196, 
197, 200 

Normal distribution: 192, 208, 209, 428; 
and paired comparisons, 191; and quali- 
tative data, 195; and scalogram anal- 
ysis, 71n, 195, 198 

NSA, see “Neuropsychiatric Screening 
Adjunct” 

nu-values (y), defined, 406, 437 


O 


Observations, methods of, 86-88 

Observed frequencies, see ‘‘Marginal fre- 
quencies” 

Occupational plans, see ‘Postwar plans for 
employment” 

Odd-jobs man, as an own-account worker, 
646, 663 

Officers, attitude toward, see “Attitude 
toward officers” 

Officers, noncommissioned, see ‘“Noncoms”’ 

Ogburn, W. F., 692n 

On-the-job training: number enrolled 
(1945, 1946), 622n; postseparation in- 
terest in, which see 

Opinion measurement, 46-59 

Opinion research: 46-59, 61, 80, 83, 86, 88, 
121, 155, 168, 169, 172, 173, 216n; bibli- 
ography of, 4n, 47n; and the intensity 
component, 213-276 

Opinions vs. attitudes: 47-48 

Optimum weights or scores, 36, 315-316, 
339, 341 

Order of areas of psychiatric problems: 
479; and correlation with criterion, 497 

Order of presentation: of items, 80, 214, 
215, 584, 677-678; of response cate- 
gories, 80, 152, 214 

Order of scale scores, 76, 89 

Ordering of items: 11, 15, 62, 185, 465; in 
equal-appearing intervals method, as re- 
stricted to dichotomous items, 185; by 
judges, 185; in latent distance scale, 442; 
in paired comparisons, 191 

in scalogram analysis: in order of dif- 

ficulty, 70; in order of extremeness or 
favorableness, 73, 76, 88-89, 93, 98-99, 
100, 144; as restricted to dichotomous 
items, 62 

Ordering of latent classes, 380, 384, 411, 
446, 453, 468 


Ordering of latent dichotomies, 447 
Ordering of respondents: 8, 60, 61, 62, 176, 
215, 363, 458, 465, 684, 687; by broad 
groups of response patterns, 26, 27, 296, 
300-301; in latent dichotomy analysis, 
20, 23, 25, 429-431, (stability of) 25 
in latent structure analysis: by broad 
groups of response patterns, 26, 27, 439- 
440, 445; if X > two, 30, 31, 208, 440, 
445-446 
in nonscalable areas, 154, 176, 446; 
and ordering of items, 62, 88 
in postwar plans studies: 684-688; 
and contingencies, 685-686; and post- 
separation performance, 684 
in scalogram analysis: 9-12, 15, 16, 
34, 67, 69, 70, 71n, 73, 80, 81, 82, 88, 
97, 133, 189, 191, 213, 275, 296, 312, 
583, 595, (Step 2) 105-108, (Step 4) 
115, 118; by broad groups of response 
patterns, 296, 300-301, 610, 684; con- 
stancy of, and number of items, 299- 
300; constancy of for both sample and 
universe, 81, 89, 151, 153, 154, 163, 177, 
216, 218, 295-296, 298, 300; as a formal 
consequence of response configuration, 
68; for nonscale and scale populations, 
88, 169; in order of favorableness, 73, 
76, 89, 93, 99, 100, 105, 106, 115, 144, 
157, 169, 175, 213, 222; in order or in- 
tensity, 40, 157, 216, 222; in order of 
knowledge, 70, 141; in quasi scales, 162, 
218; and “right” answer, 66-68; and 
sampling of items, 298-299; within each 
scale type, 106, 108; scalogram of, 107; 
by simple weights, 105, 106, 108; and 
“solid streaks,” 105, 108 
Ordering of response categories: 62, 68, 74, 
75, 89; between items vs. within items, 
62; in “cafeteria” items, 187-188 
in scalogram analysis: by a priori 
means, to save labor, 102; in initial 
arrangement of first scalogram board, 
102, 103; by second scalogram board, 
97, 100, 114, 115 
Ordering of response patterns: 20, 22, 23, 
428-431, 439-440, 441, 446, 460-465, 
468, 469; vs. conventional item analysis, 
26n; and latent distance scale, 31, 445- 
446 
Ordering of variables, conditional, 212 
Orientation of individual, 579 
Origin situation: 483-484, 689-692; as 
artificiality, 692 
Orthogonal functions and principal com- 
ponents, 357-359 
Orthogonality of latent vectors, 349-351, 
354, 359, 361 
Osborn, Maj. Gen. Frederick, 710, 711 
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Oscillations of principal components: 
number of, 353, 356; for scores, 323— 
325, 351-356, 357; and size of circle, 
353-357; for weights, 329, 356-357 

Overseas, length of time: and demobiliza- 
tion, 189; distribution of in U.S. and 
overseas, 716 

Overseas assignment, and psychiatric dis- 
orders, 490n 

Oversensitivity: 531-533; correlation of 
with criterion, 497; defined, 493, 531; 
discriminating value of, 479, 495, 531; 
list of items for, 531-533; and multiple 
regression weights, 498; and other areas, 
496, 497; as scale, 493; scores for nor- 
mals and neurotics, 531 

Own-account workers: 646, 647, 660; num- 
ber of, and number of business firms, 
646-647 


Pace, C. R., 86 

Pacific Coast and variations in: psychia- 
tric diagnoses, 475; psychiatric rejection 
rate, 554 

Paired comparisons: 12, 189-191; and 
Army Score Card Plan, 189; and aver- 
age comparison of attitudes, 190; and 
multiple dimensions, 191; and normal 
distribution, 191; and reproducibility, 
190; and scalogram analysis, 189-191 

Parabolic trace lines: 371-372, 401-407, 
432; convex and concave, defined, 402; 
terminology of, 403 

Parachute jump, worries in anticipation 
of, 145 

Parallelogram pattern, in scalogram: 67, 
70, 73, 76, 89, 91, 93, 97, 99, 108, 115, 
120, 144, 152, 153, 157, 162, 207, 212, 
312; defined analytically, 334-335 

Parents, influence of, and neurosis, 514- 
515, 524 

Partial correlation, 192, 470 

Participation in sports: 522-523; correla- 
tion of with criterion, 497; defined, 492; 
discriminating value of, 479, 495, 522; 
list of sports, 522-523; and multiple re- 
gression weights, 498; number of sports 
for normals and neurotics, 522; and 
other areas, 496, 497; as quasi scale, 
493-494; type of sports for normals and 
neurotics, 522 

Pattern of errors, as criterion for scalabil- 
ity, 78, 79, 119 

Patterns of cultural change, as predictions, 
574 

Pearson, Karl, 471 
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Pearson r, 410, 470 

Peddler, as businessman, 660 

Percentile order of respondents, in inten- 
sity analysis, see ‘Intensity percentile,” 
“Content percentile” 

Perfect Guttman scale: chance occurrence 
of, 82; existence of in actual practice, 
5, 9, 63, 64, 89, 157-159, 198, 279, 286, 
456; fit of, 456-457; implications of, 66, 
211, 296, 312, 314, 455; and latent dis- 
tance scale, 455; latent structure of, 30, 
455, 457; mathematical analysis of, 334— 
361; practical approximations to, 64, 77, 
89, 122, 158, 198, 278-279, 286-287; pre- 
cision of, 456, 457; trace lines of, 455- 
456 

Perfect reliability, 303, 309 

Perfect unreliability, 302, 300 

Performance classification: for business 
ownership, 658n; for education, 604, 
609; for government employment, 672; 
for return to previous employer, 630 

Personal adjustment: 533-535; and con- 
text, 677n; correlation of with crite- 
rion, 497; defined, 493, 533; discriminat- 
ing value of, 479, 495, 533; list of items 
for, 534-535; and multiple regreesion 
weights, 498; and other areas, 496, 497; 
prediction of, 59; as acale, 493; scores 
for normals and neurotics, 533 

Personal commitment, see “Identification 
with the war effort’”’ 

Personal esprit: see “‘Morale”’; and neuv- 
rosis, 534 

Personal involvement in war, and neu- 
rosis, 528, 529 

Personality development: areas of, 492- 
493; correlation between areas of, 496, 
497 

Personality inventories, 61, 162, 493 

Personnel accounting in Army, 717 

Phi-coefficient, 410 

Physical and administrative rejection rate 
for NSA scores: 562; and critical signs, 
563 

Physical and medical rejection rate: 550; 
for induction stations and service com- 
mands, 551-552 

Physical reactions to battle, 13, 142 

Physical rejects: and NSA scores and criti- 
cal signs, 566 

screened by NSA: 476, 558; for induc- 
tion stations and service commands, 
556, 557 

Physician, see “‘Psychiatriste”’ 

Physiological continuum, 142 

Physiological fear symptoms, 13, 142 

Pivotal condensation of determinants, 
393-394 
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Plans classification: 579, 686, 689; for 
business ownership, 650-654, 660, 663- 
665; consistency of, 635, 689; for educa- 
tion, 598-599; for farming, 640; for gov- 
ernment employment, 669, 670-671; by 
interviewing, 582; and multiple dimen- 
sions, 572-573, 625, 686; for return to 
previous employer, 625-627, 634; and 
return to school, 608-609, 610-617; by 
single item, 613-617, 681, 687, 688; of 
undecided respondents, 578-579, 681 

Plans for postwar, see ‘Postwar plans” 

Plans and reality, see ‘‘Realiam of postwar 
plans” 

Point correlation coefficients: 201-203; 
and factor analysis, 201-203, 470; for- 
mula for scale items, 203; of latent four- 
fold table, 410; matrix of, 202; and 
scalogram analysis, 70, 71, 202, 203; 
tetrads of, 202 

Point distribution: 376-379, 380, 467; with 
nine points, 454; and nonlinear trace 
lines, 469; number of points and degree 
of polynomial, 376, 377, 378, 395; with 
three points, 376-377; with two points, 
466 

Point of intersection of scale scores, 510; 
see also “Cutting point, on psychoso- 
matic complaints scores” 

Point system for demobilization, see 
“Army Score Card Plan” 

Poisson distribution and principal compo- 
nents, 325 

Poisson-Charlier polynomials, 358 

Polling vs. scaling, 48-49 

Polygonal trace lines, 377 

Polynomial trace lines: 371-375, 376, 378, 
447; cubic, 373, 375; degree of, 372, 373, 
377, 378, 395, 467; and factor analysis, 
470-471; and Guttman scale, 456; 
linear, see “Linear trace lines’; and 
manifest data, 373-375; and moments 
of distribution function, 372; and order- 
ing of response patterns, 465; parabolic 
(quadratic), see ‘Parabolic trace lines’’; 
and point distributions, 32-33, 377-379, 
469 

Population, defined, 80n 

Population characteristics, see ‘Back- 
ground characteristics” 

Population distribution of respondents, see 
“Distribution function of respondents” 

Populations differing in kind rather than 
in degree, 145, 169 

Position of items, see “Ordering of items’’ 

Position scales, 11, 12 

“Positive” responses: in latent structure 
analysis, 365, 402n; treatment of in scal- 
Ogram analysis, 100, 102-103, 105, 110 
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“Positive-neutral” responses, treatment of 
in scalogram analysis, 103, 105, 110 
Postal service, and veteran employment, 
673 
Postdemobilization performance, see 
“Postseparation activity” 
Postseparation activity: and business own- 
ership, 658-666; and education, 603— 
622; and farming, 643-644; and govern- 
ment employment, 672-675; list of items 
for, 706-708; and previous experience, 
645; and return to previous employer, 
630-639; and time of separation, 609 
Postseparation business ownership: and 
available savings, 662; and difficulties 
encountered, 665-666; and previous ex- 
perience, 662, 691 
Postseparation interest in: business owner- 
ship: (July 1945), 658, 659; items on, 
707-708; and postwar plans, (July 1945) 
659, 661, (Dec. 1945) 661, 662 664 
education, full-time: (July, Dec., 
1945), 605; and background character- 
istics (Dec. 1945), 618; and certainty of 
postwar plans, 616, 617; item on, 707; 
and postwar plans, (July 1945) 607, 609, 
613, 615, 616, (Dec. 1945) 608, 609, 611, 
613, 615, 616, 621 
education, part-time: and postwar 
plans, (Dec. 1945) 611, 612n, 620-621; 
and postwar plans for full-time educa- 
tion, 604n, 621, 694 
governmental employment: (July 
1945) 672; item on, 707; and postwar 
plans, (July 1945) 673, 674 
on-the-job training, and postwar 
plans for full-time education, 604n, 621 
return to previous employer: (July, 
Dec. 1945) 630; and certainty of post- 
war plans, 634; and duration of previous 
employment, 636; item on, 706-707; for 
July 1945 vs. Dec. 1945 separatees, 629, 
631-633, 638-639, 691; and occupational 
level, 638; and postwar plans, (July 
1945) 630, 632, (Dec. 1945) 630-631, 
633; and type of employer, 637-638 
Postseparation return to farming: item 
on, 707; and postwar plans (July 1945), 
643, 644; and previous experience, 644 
Postwar plans (of soldiers): 477-478, 479- 
482, 568-708; (summer 1944), 597-598; 
(July and Dec. 1945), 613, 616; and see 
below under separate headings; and al- 
ternatives, 572; analysis of, 584; for 
Army training (example of item), 291; 
background of, 571-573; certainty of, 
616-617, 634; and change in situation, 
570, 579, 612; as complex, 676-677; and 
demand for facilities, 578, 586; as de- 
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pressing, 677n; and follow-up studies, 
586-587; interpretation of by consum- 
ers, 577; lists of items for, 481-482, 652n, 
698-705; for migration, nonscalability 
of, 584; and performance, 580-581, 586, 
603-622, 631-639, 643-644, 658-666, 
672-675; and prediction, 578-581, 600, 
689-695; and preliminary interview, 
581-583, 676; pretests of, 570-572, 577— 
579, 583-585, 640, 647-648, 650-654, 
669-670, 677n, 678, 679, 680-681, 682; 
and previous experience, 645, 691; real- 
ism of, which see; and scalogram anal- 
ysis, 478; suocess of prediction, 482, 586, 
570; for vacation, success of prediction, 
694 
Postwar plans for business ownership: 
645-667; (spring 1944), 648; (summer 
1944), 597, 649, 654, 656; (summer 
1945), 657; and competition, 655-657; 
definite, defined, 649-650, 660n; and 
GI loans, 645-646, 647, 656, 665; and 
government agencies, 569; item lists for, 
648, 649, 700-701; and material short- 
ages, 482, 657; and postseparation inter- 
est in ‘education, 616; and postwar 
plans pretest, 584, 585, 647-648, 650- 
654; as a prestige item, 482 
of separatees: (July 1945), 657, 661; 
(Dec. 1945), 658, 661; and postsepara- 
tion performance, (July 1945) 659, 661, 
(Dec. 1945) 661, 662, 664 
and size of sample, 582-583; success 
of prediction, 482, 570, 657, 658, 661- 
662, 664 
Postwar plans for education: 480, 587-622; 
and employment, 480, 481, 482, 589, 
601, 679, 691; without Federal aid, 588— 
597, 600; full-time, see below; and gov- 
ernment agencies, 569; item lists for, 
588-590, 597-598, 680-681, 701-702; 
- and Negroes, 599n; and officers, 599n, 
600; and ordering of respondents, 685 
part-time: (summer 1943), 590; (sum- 
mer 1944), 598; and full-time education, 
(summer 1943) 589, 592-593, (Dec. 
1945) 612, 621; and government aid, 
598; and postseparation performance, 
611, 612n, 620-621 
and performance, 603-622; in postwar 
plans pretest, 583, 585; and reliability 
of coefficient of reproducibility, 282- 
283; as a scale, 593-595, 598-599; and 
separation, 602-603, 619-620; success of 
prediction, 482, 570, 605-606, 614, 621, 
622; and write-in item, 679 
Postwar plans for education, full-time: 
(summer 1943), 588, 590, 592-593, 595- 
596; (summer 1944), 598, 599, 600, 601; 
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(summer 1945), 601, 602; definite and 
tentative, defined, 599; and educational 
level (summer 1943), 594; and govern- 
ment aid, (summer 1943) 588, 589, 590- 
592, 638-684, (summer 1944) 600; and 
government aid and jobs (summer 1943), 
589, 590, 591; interest in, (summer 1943) 
588, 590, (summer 1944) 598; and jobs 
(summer 1943), 589, 590-592; and 
length of war (summer 1943), 589, 590, 
594; and part-time education (summer 
1943), 589, 592-593 

of separatees: (July and Dec. 1945), 
602, 603; (July 1946), 602; definite and 
tentative, 607, 612-614, 684n; and post- 
separation performance, (July 1945) 
607, 609, 613, 615, 616, (Dec. 1945) 608, 
609, 611, 613, 615, 616, 621 


Postwar plans for employment: 572-573; 


(summer 1944), 597-598, 638; and edu- 
cation, 480-481, 589, 601, 616, 679; with 
former employer, see ‘‘Postwar plans to 
return to previous employer”; and the 
labor market, 478; lists of items for, 292, 
481, 699; in postwar plans pretest, 583, 
679; success of prediction, 482, 570 


Postwar plans for farming: 639-645; (sum- 


mer 1944), 641; definite, defined, 641; 
and duration of preinduction farming, 
642; and government agencies, 569; and 
illiterates, 643; list of items for, 702-703; 
and plans for ownership of farms, 642; 
‘and postseparation interest in educa- 
tion, 616; in postwar plans pretest, 584; 
as a scale, 584; of separatees, and post- 
separation performance, 643, 644; suc- 
cess of prediction, 482, 570, 643 


Postwar plans for governmental employ- 


ment: 667-675; (summer 1944), 597, 
668-669; definite, defined, 669, 670-671; 
list of items for, 669-670, 702; and poat- 
separation performance, 673, 674; and 
postwar plans pretest, 669-670; of sepa- 
ratees, 671; success of prediction, 482, 
570, 674, 675 

Postwar plans of Negro soldiers, and illit- 
eracy, 643 

Postwar plans to return to previous em- 
ployer: 623-639; (Dec. 1943), 624-625; 
(summer 1944), 626, 628; definite, de- 
fined, 627, 634; and government agen- 
cies, 569; lists of items for, 481, 625n- 
626n, 699; and occupational level, 638; 
and postseparation interest in educa- 
tion, 616 

of separatees: 628, 631n, 634; and 

duration of previous employment, 629; 
and occupational level (July 1945), 638; 
and postseparation performance, (July 
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1945) 632, (Dec. 1945) 633; and type of 
employer, 637 
success of prediction, 570, 630-631; 

tentative, defined, 627, 634 

Postwar psychological conditions, 473 

Precision, in latent structure analysis: 
430-431, 440-441; and addition of 
items, 431, 441; and fit, 431; for latent 
dichotomy example, 27, 431; for perfect 
Guttman scale, 456; and reproducibil- 
ity, 456; for restricted latent trichotomy 
example, 441 

Predictability: see also “Prediction”; and 
intervening situations, 484, 690-692; 
and learning theory, 484, 690-691; as 
measured by correlation coefficients, 50, 
71n, 198; as measured by mode, 194— 
195, 287-288, 308; of multivariage dis- 
tribution from univariate distributions, 
in perfect Guttman scale, 69, 71, 74, 76; 
in nonscalable area, as underestimated 
by arbitrary index, 178-179, 180, 186; 
and nonscale items, 480, 685-686; of 
outside variable from scale scores, 89, 
155, 160, 162, 175, 179, 182, 301, 360- 
361; of psychiatric screening tests, 567; 
and unconscious elements, 483-484, 690; 
and verbal ability, 484, 690 

Prediction: 173-174, 178-181, 288, 312, 
361, 473-695; absolute, 573, 574; actu- 
arial, 573, 574, 694; of attempts to fulfill 
plans, 578, 580, 670; and background 
characteristics, 569-570, 573, 618-619, 
678; of business ownership by trends, 
676; classification of types of, 574; and 
composite, 56; as a correlate to atti- 
tudes, 50, 51, 56; and detection, 489; 
of economists, 478, 571, 606, 655; and 
frame of reference, 573, 679, 687; and 
generalizations, 676, 695; for groups, 
574, 575; imperfect, 56, 312; for the 
individual, 574-575, 576, 607-622, 627, 
631-633, 644, 658-661, 664, 673-675, 
688-689, 695; by individual items, 612- 
617; and large-scale social planning, 
675; and metrics, 312; and military exi- 
gencies, 473; of number of men staying 
in Regular Army, 133, 135, 137, 139; 
and origin, test, and criterion situations, 
483-485, 689-693; of outside variable 
by equal-appearing intervals, 187; of 
outside variable by item analysis, 181- 
182, 183; perfect, 56, 312; of perform- 
ance level, 574, 575; and plans, 578- 
581, 608-610, 654, 689-695; of post- 
war business ownership, 645-667; of 
postwar education, 587-622; of postwar 
employment, 480; of postwar farming, 
639-645; of postwar governmental em- 
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ployment, 667-675; of postwar plans: 
568-695; of postwar psychological con- 
ditions, 473; of postwar return to previ- 
ous employer, 623-639; and primary 
activities, 693-694; problems of, 474, 
477, 481-482, 502-508, 567, 568-581, 
593, 605, 610, 631, 643, 646-648, 655, 
657, 688-689; and programs of agencies, 
578, 647; of psychiatric rejections, 486— 
567; of psychiatrists, 490, 507-508; of 
psychiatrists’ behavior, 477, 489, 549; 
of qualitative variables, 173, 174, 194, 
308, 495n; of quantitative variables, 
178, 308; and scales, 684-688; of school 
attendance, (summer 1943) 596, (sum- 
mer 1945) 605, 606, (by trends) 575~ 
576; success of, 482, 483-485, 568, 570, 
605-606, 614, 619, 620, 622, 630-631, 
639, 643, 658, 664, 666-667, 684, 690- 
695; for total Army population at time 
of survey, 576-577; by trends, 575-576; 
types of, 573-575; use of, 568, 577, 647, 
655; and validations, 580, 635 

Preinduction examinees: disposition of: 
549-555; and NSA score, 562; and NSA 
score and critical signs, 563 

and NSA scores and critical signs, 566 

screened by NSA: 555, 558; for induc- 
tion stations and service commands, 
556-557, 560 

Prejudice, 459, 467 

Premuitiplication of matrices, 388 

Preseparation plans, see ‘Postwar plans’’ 

Prestige occupations: 482, 677-678, 681, 
682, 687, 694; rate of return to, 638 

Pretest: for NSA, see “Psychiatric test 
pretest” ; for postwar plans studies, 583— 
585; use of, to determine scalability, 79, 
158, 283 

Pride in one’s outfit, 163 

Principal axes: 39n, 316n; see “Principal 
components” 

Principal components: 5, 7, 33-44, 203- 
204, 312-361; see also “Closure compo- 
nent,” ‘Intensity component,” “Metric 
component” 

and alternating signs of bending 
points, 351-353; and conditionally or- 
dered variables, 212; and constant com- 
ponent, 322, 328, 332, 340, 345, 353n, 
356; and correlation ratios vs. number 
of oscillations, 353-357; as curvilinear 
functions of scale scores, 39, 204, 217, 
326, 329, 330, 333; and difference equa- 
tions, 341-349; empirical study of, 287n, 
333; example of, 35-39, 317-333; and 
factor analysis, 313, 330-331; and item 
responses, 330-333; linear correlation 
between, 39, 204, 329-330, 349; mathe- 
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matical analysis of, 334-361; and non- 
scalable areas, 39, 320, 332-333, 341, 
360; number of, 320, 322, 325, 340, 346; 
and oscillation, 351-357; and reproduci- 
bility, 331-333, 359-360; and sampling 
of items, 326-327; shape of curves, 34, 
39, 204, 287n, 323-326; shape of curves 
for weights, 328-329; and Tchebychef’s 
solution, 358; and unidimensionality, 
330-331 

Principal component scores, see ‘‘Principal 
components,”’ “Scores for persons” 

Principal component weights, see ‘“Prin- 
cipal components,” “Weights for items” 

“Private” response of Negroes to Negro 
interviewers, 721n 

Privates: and attitude toward the WAC, 
243, 244; and noncoms, 243 

and satisfaction with one’s Army job: 
243, 246; among high school graduates, 
243, 248 

Probability of positive response, see 
“Trace line’ 

Probability sampling, see “Sampling of 
people” 

Product-moment correlation coefficients: 
192, 198, 200, 201, 298, 209, 329, 495; 
and qualitative variates, 194; and quan- 
titative variates, 195, 196, 197 

Profile of scale scores, 175 

Projection of trends, 575-576, 587, 666, 
688 


Projective items, 459-460 

Projective techniques, 479 

Propaganda, effectiveness of, 170 

Psychiatric diagnoses: 486, 549-561, 567; 
and areas of psychiatric import, 492, 
493; and Army hospitals, 491; and criti- 
cal signs and NSA scores, 564; and NSA 
scores, 475, 477, 548-549, 555-567; and 
psychiatric patients, 490n, 507; and 
psychiatric predictions, 507; of psychia- 
tric rejects, 553-554; and scalogram 
analysis, 478; variability in, 475, 476— 
477, 550, 552, 554-555, 558, 565 

Psychiatric diagnosis rate (for every class 
of psychiatric rejects) and NSA screen- 
ing rate, 559n 

Psychiatric diagnosis rate (‘‘others’”’) (% 
of total psychiatric rejects): 553; for 
induction stations and service com- 
mands, 553-554; for NSA scores and 
critical signs, 564 

Psychiatric discharge rate (% of inductees 
later discharged for psychiatric reasons) 
and psychiatric rejection rate, 507 

Psychiatric examination and NSA, 548— 
549, 555-567 


Psychiatric problem areas, see “Areas of 


psychiatric problems” 


Psychiatric rate, in Army, and education, 
561 


Psychiatric rejection, reasons for, see 


“Psychiatric diagnoses” 


Psychiatric rejection rate (% of preinduc- 


tion examinees rejected for any psychi- 
atric reason): 550; for induction stations 
and service commands, 551-552, 660; 
for NSA scores, 561, 562; for NSA 
scores and critical signs, 562, 563; and 
NSA screening rate, 476, 558, 560-561; 
and psychiatric discharge rate, 507; 
range among induction stations, 475, 
550; variability among regions of U.S., 
561; variability between regions of 
U.8., 552, 554 


Psychiatric rejection rate (‘‘others’’) (% of 


preinduction examinees rejected as 
“other’’ psychiatric cases), 550 


Psychiatric rejects (those rejected for any 


psychiatric reason): dominant tendency 
for classification of, 550; and NSA 
scores and critical signs, 566; psychiatric 
diagnoses of, 553-554; screened by 
NSA, 475, 508, 510, 558, 565; screened 
by NSA for induction stations and serv- 
ice commands, 556-557, 559, 560 


Psychiatrie rejects (‘‘others”): and NSA 


scores and critical signs, 566; screened 
by NSA, 475, 558; screened by NSA for 
induction stations and service com- 
mands, 556-557; variability of, 558 


Psychiatric screening: see also “Screening”; 


and critical signs, 511; effect of errors 
in, 489; and inclusive screening, 488, 
489, 555; and induction examination, 
488, 508, 555; and malingering, 489; and 
screening of psychoneurotics, 502, 511 


Psychiatric screening tests: and areas of 


discrimination, 479, 492-493; design of 
initial study, 489-492; predictability of, 
567; and psychosomatic complaints 
area, 499; to replace psychiatrists, 489 


Psychiatric selection, in Army, 510 
Psychiatric test pretest: 489-501, 503-504; 


areas covered in, 492-493; criterion 
group for, 490; derivation of one area to 
represent all 15 areas, 494-499; and 
effect of anonymity, 504, 505; at induc- 
tion station, 508-511, 549; lists of items 
for, 512-538; sampling for, 491; time of, 
491 


Psychiatrists: 475; availability of, 475, 


487, 488, 548, 567; behavior of at induc- 
tion stations, 474, 477, 488, 489, 507n, 
508, 512; and critical signs, 511; and 
diagnosis and disposition of cases, 549- 
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555, 561; and peychiatric testa, 489, 508, 
548-549 

Psychological description and generaliza- 
tion, inadequacy of rank order for, 65 

Psychological measurement, 4n 

Psychoneurotic diagnosis and psychoneu- 
rotic rejection, 507n 

Psychoneurotic diagnosis rate (% of total 
psychiatric rejects): 553; for induction 
stations and service commands, 553- 
554; for NSA scores and critical signs, 
564, 565; range among induction sta- 
tions, 475, 550; variability within re- 
gions of U.S., 475 

Psychoneurotic inventory, see ‘“Psychoso- 
matic complaints” 

Psychoneurotic patients: in psychiatric 
test pretest: 489; and Army croes sec- 
tion, 492, 493, 494-497, 499-500, 538n; 
correspondence to inductees, 502, 505- 
511; distribution of background charac- 
teristics, 512-514; distribution of re- 
sponses to items of psychiatric import, 
514-538; as ‘“‘ineffectives,’’ 490; peycho- 
somatic complaints scores of, 501, 547; 
sampling of, 491, 512; size of sample of, 
495, 512 

and sampling bias, 713n 

Psychoneurotic rejection, and ability to 
perform a day’s work, 490, 507n 

Psychoneurotic rejection rate (% of pre- 
induction examinees rejected on psycho- 
neurotic grounds), 550 

Psychoneurotic rejects (those rejected for 
psychoneurotic reasons): and NSA 
scores and critical signs, 566; and psy- 
chopaths, 559; screened by NSA, 475, 
508, 558; screened by NSA for induction 
stations and service commands, 556-557 

Psychoneurotic screening, see under 
“Screening” 

Psychoneurotic symptoms, see ““Psychoso- 
matic complaints” 

Psychoneurotics, acceptability of in Army 
life, 490, 507n 

Psychoneurotics to be discharged, as cri- 
terion group, 491-492 

Psychoneurosis in Army: and background 
characteristics, see separate character- 
istics; as basis for discharge or rejection, 
490; and combat duty, 490n; and effec- 
tiveness, 490; and psychosomatic com- 
plaints scores, 502, 506 

Psychopathic diagnosis rate (% of total 
psychiatric rejects): 553; for induction 
stations and service commands, 553- 
554; for NSA scores and critical signs, 
564, 565; range among induction sta- 
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tions, 475; variability within regions of 
U.8., 475, 554-555 

Psychopathic rejection rate (% of pre- 
induction examinees rejected as psycho- 
paths), 550 

Psychopathic rejects (those rejected as 
psychopaths): and NSA scores and criti- 
cal signs, 566; and psychoneurotics, 559; 
screened by NSA, 475, 558; screened by 
NSA for induction statiors and service 
commands, 556-557 

Psychophysical methods, 4n, 12n 

Psychosomatic complaining, varying level 
of: in civilian vs. military life, 502-503, 
506, 508-511; in signed vs. anonymous 
questionnaires, 504, 505 

Psychosomatic complaints: 441, 535-538; 
ace also ‘‘Neuropsychiatric Screening 
Adjunct”’; correlation of with criterion, 
497; defined, 493; discriminating value 
of, 479, 494-495, 501; joint occurrences 
of, 540-543 

latent distance analysis of: 31, 441~ 
446; computed latent structure for, 31, 
444; fitted response pattern frequencies 
of, 445; manifest data for, 445; total 
misfit of, 457 

list of items for, 31, 165-166, 441, 535- 
538; and multiple regression weights, 
498; and NSA, 511; and otber areas, 
496, 497; as quasi scale, 15, 31, 162, 479, 
494, 499, 547; scalogram of, 164, 458; 
scalogram analysis of, 162, 457; scores 
for normals and neurotics, 501; scores 
for inductees and rejects, 509; as substi- 
tute for all 15 areas of psychiatric im- 
port, 479, 499; and varying weights, 
499-501, 547 

Psychosomatic complaints score: adapta- 
tion of to induction station screening, 
502-512; and anonymity, 504, 505; and 
Army life, 508-510; and context, 503- 
504; distribution of, 501, 509: further 
refinements of, 499-501; and hospital- 
ization, etc., 505-506; as substitute for 
score on all 15 areas of psychiatric im- 
port, 499; test-retest reliability of, 503; 
weighting of, 547 

Psychotic diagnosis rate (% of total psy- 
chiatric rejects): 553; for induction sta- 
tions and service commands, 553-554; 
for NSA scores and critical signs, 564 

Psychotic rejection rate (% of preinduc- 
tion examinees rejected as psychotics), 
550 

Psychotic rejects (those rejected as psy- 
chotics): and NSA scores and critical 
signs, 568; screened by NSA, 475, 558; 
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screened by NSA for induction stations 
and service commands, 556-557 

Public Law 16, 606 

Public Law 346, see “GI bill’’ 

Public opinion polling, 18, 42, 43, 48, 49, 
57, 58, 60, 61, 63, 68, 72, 87, 88, 150-151, 
154, 155, 156, 174, 177, 180, 187, 214, 
215, 266, 296 474, 484, 676 

“Public” response of Negroes to white in- 
terviewers, 721n 

Puerto Rican troops, 720n 

Punishment, see ‘‘Learning theory” 

Pure test: 366-369, 370, 377, 394, 408, 422; 
defined, 367, 368, 369 

Purpose of research, 3, 45, 473 


Q 


Qualitative observations, basis for proper 
quantification of, 61, 313-314 

Qualitative and quantitative variables: 
192-193, 195, 199, 205, 308; and latent 
structure theory, 206, 209; and predic- 
tion and correlation, 193 

Qualitative variable, see “Attribute” 

“Quantifying data,’’ 193 

Quantitative variables, in latent structure 
theory, 472 

Quasi scales: 5, 6, 7, 15, 18, 19, 40, 41, 43, 
159-163, 205; analytic exposition of, 30, 
31, 207, 410-412; correlation of with 
outside criterion, 5, 15, 79, 494, 547; 
definition by Guttman, 79, 159, 160, 
494; examples of, 162, 493-494; impor- 
tance of, 162, 494; and intensity anal- 
ysis, 40, 218, 222, 249; and latent 
structure theory, 7, 30, 207, 211, 458; 
and perfect Guttman scale, 7, 30, 79; 
and quantitative variables, 208; and 
sample size, 283; stability of, 499-500; 
and scales, 5, 15, 159-160, 218, 234, 494 

Quasi-scale item, example of, 160-162 

Questions, see “Items”’ 

Questionnaire: and interviews, 581-583, 
676, 718-721; for postseparation per- 
formance, 706-708; for postwar plans, 
695-706; for psychoneurotic screening, 
512-538; and research, 53-54; as a sub- 
universe of behavior, 53, 54, 86-88 

Questionnaire administration: 718-721; 
and item wording, 709; and Negro in- 
terviewers, 720 

Questionnaire development, 478, 583-585, 
676-678 

Questionnaire length: and error, 610; and 
fold-over techniques, 41, 249, 252, 276 

Quintiles and attitude research, 296 

Quota sampling, see “Sampling, purpos- 
ive” 
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R 


Race: and free comments, 721~722; and 
nonresponse on surveys, 719 

Racial protest of Negroes, Negro vs. white 
interviewers, 720 

RAND Corporation, vii, 12, 13n, 413n, 
460 

Random errors, 79, 160, 161 

Random numbers and latent structure 
theory, 429 

Random (stratified) sampling: 54n, 279, 
280, 283, 711-714; and sampling of 
items, 286-287; and screening tests, 487 

Random variable, 193 

Range of marginal! frequencies, as criterion 
for scalability, 78, 79, 119, 287 

Rank of correlation matrix, 470 

Rank of enlisted men, distribution in U.8. 
and overseas, 716 

Rank of matrix of joint positive frequen- 
cies: 372, 373, 377, 393, 414; and degree 
of polynomial trace lines, 372, 373, 467; 
and number of latent classes, 393, 
467 

Rank of respondents: see “Ordering of re- 
spondents”; and items endorsed, see 
“Reproducibility,” ‘Score, as a value of 
a scale variable” 

Rank groups, number necessary for order- 
ing people, 8, 79, 81, 133, 219, 296 

Rank orders: distribution of, 81-82; of 
items, see “Ordering of items’; of re- 
spondents, see “Ordering of respond- 
ents’’; as scale variables, 65, 312, 324 

Ranking, see ‘‘Ordering”’ 

Realism of postwar plans, 569-570, 571, 
576, 579, 582, 621, 646, 651, 654, 677, 
682, 689, 690, 691 

Rebelliousness and neurosis, 532, 533 

Reducibility: 381-384, 393-396, 417; con- 
ditions of, which see; example of, for 
latent dichotomy, 383; number of inde- 
pendent conditions for, 393; from three 
to two latent classes, 396 

Reemployment, postseparation, as guar- 
anteed, 571, 623-624 

Reenlistment in Army vs. desire for an 
Army career, 139 

Region of indifference: 34; see also “Zone 
of relative indifference’”’ 

Regional variations: in induction rate, 
560; in psychiatric rejection rate, 560, 
561; in NSA screening rate, 55s, 560, 
561 

Regression: of criterion on composite, 55- 
56; in factor analysis, 471; linear, 192, 
208, 361; multiple, 56, 361, 497, 498, 
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500, 538, 547; multiple, and the dis- 
criminant function, 495n - 

Regression equations, 173, 539 

Regression medians of intensity on con- 
tent, 219 

Regular Army, desire for career in, 133, 
135, 137, 139 

Rejection for psychiatric grounds, see un- 
der “Psychiatric” 

Relative marginals, 381 

Relativity of scales, 82, 83, 89-90, 168- 
170, 239, 243, 249 

Release from Army, see “‘Army Score Card 
Plan” 

Reliability: in latent structure analysis: 
25, 27, 452; and broad groupings of re- 
sponse patterns, 27; and missing fre- 
quencies, 394 

of plans classification, 579; of plans 
for education, 592, 602-603; of postwar 
plans, 635, 689; of psychosomatic com- 
plaints score, 503 

in scalogram analysis: 10, 18, 8in, 
277-311; see also ‘Coefficient of relia- 
bility”; of intensity function, 301-302; 
of marginal frequencies, 283; of part 
scores and whole scores, 296-299; and 
reproducibility, 305-308; of scale ranks, 
295-296; and scale scores, 308-311; and 
size of sample, 299-301 

Repression and prediction, 483-484, 690 

Reproducibility: 9, 14, 15, 16n, 17, 62, 66, 
70, 74, 77-80, 91, 110, 111, 118, 122, 
154, 159, 174, 179, 182-184, 186, 189- 
191, 198, 199, 201, 204, 205, 213, 218, 
252n, 278, 283, 286, 307, 311, 333, 456, 
493n, 494; see also ‘‘Coefficient of repro- 
ducibility”; of correlation coefficients, 
205; and fit, 456; improvement of, in 
combination of categories, 111-115; and 
precision, 456; from principal compo- 
nents, 331-333, 359-360; and reliability, 
305-308; of single item, 287, 291 

Research Branch and local commanders, 
711 

Research, postwar, vi, vii, 19, 722 

Resentment and neurosis, 532 

Residence, distribution of for normals and 
neurotics, 513, 525 

Resolution into principal components, 322 

Respondent types, 334-335 

Response categories, see ‘“Categories”’ 

Response frequencies: see also ‘Marginal 
frequencies”; for categories, 334, 336; 
for respondents, 334, 335-336 

Response patterns: 21-27, 31, 363, 369, 
371, 382-383; ambiguous, 445-446; dis- 
persion of, 465 

fitted: computation of, 427-429, 439, 
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445; for latent dichotomy, 22, 23, 25, 28, 
414, 428; for latent distance scale, 445; 
for restricted latent trichotomy, 440 

and joint positive frequencies, 381, 
418; for latent structure of perfect Gutt- 
man scale, 456, 457; number of, for 
dichotomies, 381; ordering of, which see; 
and principal components, 34; and trace 
line scores, 460-465; and weights as- 
signed by judges, 186 

Response types in scalogram analysis: 100; 
see also ‘‘ ‘Positive’ responses,” “‘ ‘Nega- 
tive’ responses,” etc. 

Responses: correct. by guessing, 67n; as 
functions of principal components, 330— 
332, 359-360; inconsistent, 60; nondi- 
rective, 152n (see also “‘Items, write-in’) 

Restricted latent trichotomy: analytic ex- 
planation of, 32, 401-408; computed 
latent structure of, 438; conditions of 
reducibility for, 404-405, 407n; cross- 
product matrix for, 404; empirical ex- 
ample of, 432-441; and latent distance 
scales, 447; and latent linearity, 446; 
latent structure of, 403, 434; and order- 
ing of latent classes, 446; and polyno- 
mia] trace lines, 32, 33, 402; possible 
patterns of latent marginals for, 32; 
terminology of, 403, 433n; and trace 
lines, 32 

Restricted quadratic case, see “Restricted 
latent trichotomy” 

Rewards, see “Learning theory” 

Richardson, M. W., 503 

Riflemen, and combat fears, 13 

Riker, B. L., 47n 

Robinson, W. S., 219n 

Rorschach tests, 479 

Rossi, Peter, 413n 

Russia, attitude toward, see ‘Attitude 
toward Russia” 


S as computational magnitude: 374-375, 
399-465; see ‘“‘S-value”’ 
S as system, 380-393 
Sample selection: 709-718; adequacy of, 
715-718 
Sample size: in item analysis, 182n; in 
NSA study (August 1945), 551; in post~ 
war plans studies, 582, (summer 1944) 
597; and prediction, 283; in psychiatric 
test pretest, 512 
in sealogram analysis: 77, 100; in- 
crease of to provide stability, 78, 114, 
133, 283, 300; and pretest, 283; and 
quasi scales, 283 
and zero point, 283, 302 
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Sampling: of behavior, 52-54; and bias, 
714, 717 
of items: 54, 59, 77, 80-82, 87-89, 100, 
158, 162, 163, 166, 172, 177-178, 179- 
181, 214, 215, 275, 277, 278, 286-302, 
313; definition of process, 82n, 286; and 
equations of scale analysis, 313; and 
errors of observation, 302; and inten- 
sity analysis, 216, 218, 249, 275, 301- 
302; in nonscalable areas, 177, 179, 180, 
300, 333; and principal components, 
326-327; and random sampling, 286- 
287, 327; and reliability, 295-302, 307, 
310 
of organisations, see “Sampling, pur- 
posive”’ 
of people: 54n, 77, 78, 82, 114, 195, 
277-283, 710-714; for groups to be com- 
pared, 492; and hypothesis of approxi- 
mate perfection, 278-279; and intensity 
analysis, 301-302; method used in Army 
surveys, 711-714; in pretest to psychi- 
atric screening test, 491-492; in scalo- 
gram analysis, 100 
purposive, 710, 713, 714-715; of 
trials, see ‘‘Test-retest reliability”; of 
triplets of items, 425 
Sampling design: 709-718, 722; and atti- 
tude data, 710; and larger U.S. posts, 
715; and local commanders, 711-714 
Sampling error: 25, 72, 77, 195, 219, 279, 
283, 498, 538, 547; and absentees, 713n; 
and item wording, 709, 722; for latent 
structure theory, 414, 422, 429, 437; of 
marginal frequencies, 283; of sample 
median intensities, 302 
Sampling theory: 709; of scale error, 279- 
280, 302 
Sampling variability: 25, 42, 43, 219, 279, 
280; for Research Branch sample, 716~- 
717 
Satisfaction with one’s Army job: 163, 
239, 243, 433; content scores vs. inten- 
sity scores, 242, 246, 247, 248 
intensity analysis of: 239, 242; by 
fold-over technique, 253; among non- 
coms, 243, 247; among privates, 243, 
246, (high school graduates) 243, 248 
list of items for, 125, 291, 433, 529- 
530; among Negroes, for Negro and 
white interviewers, 720 
restricted latent trichotomy analysis 
of: 482-441; computed latent structure 
for, 438; empirical cross-product matrix 
of, 433; fitted response pattern frequen- 
cies for, 440; manifest data for, 440; 
precision of, 441 
scalogram of, 124; scalogram analysis 
of, 123 


Savage, L. J., 371n, 401n, 466n 

Savings: and postseparation business own- 
ership, 662; and postwar plans for busi- 
ness ownership, 655, 656 

Scalability: 4-20, 30, 34, 39, 45, 48, 49, 
51, 52n, 62, 65, 71, 78-85, 88, 89, 91, 
98, 99, 117, 143, 150, 153, 163, 166, 167, 
169n, 175, 177, 179, 180, 182, 184, 188, 
190, 192, 204, 215, 270, 275, 277, 278, 
283, 288, 291, 293, 294, 300, 308, 334, 
686; consequences of, 18; criteria of, see 
“Criteria of scalogram analysis’; max- 
imization of, 16, 112~113; occurrence 
of, 157-159; and principal components, 
332-333; as a problem for attitude stu- 
dents and public opinion pollera, 48-49; 
of subsets, 84, 115; and test-retest re- 
liability, 305, 311 

Scale: see also ‘Latent structure theory,” 
“Perfect Guttman scale,” ‘Scales,” 
“Scalogram analysis’; applicability of, 
172, 173, 479; as defined by Guttman, 
5, 9, 62, 63, 64, 74, 76, 77, 88, 88, 91, 
143, 159, 174, 181, 361, 493, 500, 686; 
of dichotomies, example of, 10, 65-74, 
202; as distinct from universe, 84; of 
intensity, see ‘Intensity scales’’; latent, 
see “Latent distance scale’; and non- 
scale approaches, 172-174; perfect, see 
“Perfect Guttman scale’; as sample 
from universe, 172, 173; of stature, 10; 
of trichotomies, example of, 74-77; util 
ity of, 143, 153-157 

Scale analysis: 46n; see ‘‘Scalogram anal- 
ysis” 

Scale error, see “Error,” “Errors of repro- 
ducibility” 

Scale matrix, analysis of, 334-357 

Scale order: 64; see “Ordering of respond- 
ents in scalogram analysis” 

Scale pattern, see ““Parallelogram pattern” 

Scale rank: 143; see “Score, as a value of 
a scale variable” 

Scale score: 64; see “‘Score”’ 

Scale theory, 3-45, 381 

Scale type: assignment of respondent to, 
106n, 115, 119, 120, 595, 610; defined, 
in scalogram analysis, 106n, 290; as 
function of cutting points, 115; and pre- 
diction, 283; and sampling error, 283; 
and scoring population, 119, 120; and 
zero point, 283 

Scale type reduction, in perfect scale, 75, 
144 


Scale types: distribution of, 327; number 
of (defined), 75, 8in, 114, 115, 317 

Scale variable, 64, 65 

Scales: and contingencies, 685-686; and 
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equal-appearing intervals, 186; and 
item analysis, 8, 181-185; and latent 
structure analysis, 207, 211, 446; non- 
existence of, in universe, 82n, 83, 89, 
168; and problems in description, 153— 
154; and problems in interpretation, 
154-155; and problems in prediction, 
155-156, 173-174, 178-181, 199, 479- 
480, 684-688; and quasi scales, 5, 15, 
159-160, 218; relativity of, which see 

Scaling, 3-45, 446, 479 

Scalogram: see ‘‘Scalogram analysis”; bar 
chart representation of, 69, 73, 76, 290; 
hypothetical examples of, 10, 35, 67, 
116, 289, 290, 317; key to those pictured 
in this volume, 103; matrix of, 333, 334 

Scalogram on: attitude toward the Army: 
initial arrangement, 104; ranking of re- 
spondents, 107; combining categories, 
109; final arrangement, 118 . 

attitude toward an Army career, 136; 

attitude toward combat, 148; attitude 
toward officers, 126, 284, 285; attitude 
toward postwar conscription, 134; atti- 
tude toward ‘‘sweating out a jump,” 
146; attitude toward the WAC, 130; 
fear symptoms, 140; generalized verbal 
intensity, 258; independent dichotomies, 
289, 290; independent trichotomies, 294, 
295 


intensity of: attitude toward the 
Army, 228; attitude toward officers, 
224; knowledge of current events, 232 
knowledge of current events, 138; 
psychosomatic complaints, 164, 458; 
satisfaction with one’s Army job, 124 
Scalogram analysis: 5-7, 9-19, 34, 35, 39, 
46, 47, 51, 52, 60-212, 277-311; advan- 
tages of, 7, 39, 46, 61, 71n, 121, 141, 143, 
144, 151, 153-156, 171, 172, 174, 204, 
301, 455; and amount of work required, 
121; and arbitrary indexes, 174-181; 
and arbitrary scoring, 176n; basic steps 
of, 98-100; basis for, 60-90; and be- 
havior of judges, 186; bibliography of, 
90; and “‘cafeteria” items, 187-189; and 
combination of categories, 99-100, 108— 
115, 293; and content, 85, 119, 129, 167; 
criticism of, 133, 158; and the definition 
of the problem, 144-160; and determina- 
tion of characterizing items, 167; and 
differences in kind, see ““Nonscale types” ; 
and equal-appearing intervals, 185-187 
equations of: 312-361; description 
and illustration of, 313-333; and inter- 
nal consistency, 315-316; and latent 
structure, 455; mathematical analysis, 
334-361 
example of, 98—121; examples of items 
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used in, 10, 13; examples of utility of, 
145-150; and factor analysis, 6-7, 61, 
191-212, 330-331; final arrangement of, 
100, 115, 118; and the halo effect, 167- 
168; initial arrangement of, 98-99, 100— 
105; and intensity analysis, 213-276; 
and internal consistency, 181, 182; and 
item analysis, 181-185; items used in, 
10, 11, 13, 460; and item wording, 150— 
153; and latent structure theory, 7, 20, 
33, 45, 210-211, 455-460; and the meas- 
urement of intensity, 40-43, 156-157, 
213-276; and nonscale items, 288-295; 
and ordering of respondents, 99, 105- 
108; and paired comparisons, 189-191; 
and perfect scale types, 116; and post- 
war plans for education, 593-595, 598— 
599; and prediction, 595, 684; and prin- 
cipal components, 33-44, 312-361; and 
problems of interpretation, 151-152; 
and the problem of metric, 65, 312- 
313; and problems of reliability, 277- 
311; and problems of response categories, 
152, 153; and psychiatric diagnosis, 162, 
478, 493-494; and quasi scales, 162-163; 
and relation to other techniques, 61, 
172-212; and the relativity of scales, 
168-170; and reproducibility as a lower 
bound to reliability, 305-308; and sam- 
pling the universe of items, 163, 166, 
177, 286-302; as a special case of latent 
structure theory, 30, 31, 207, 210, 455; 
and the split-half lower bound to relia- 
bility coefficient for scale scores, 309- 
311; stability of, 494n (see also ‘‘Order- 
ing of respondents in scalogram analysis, 
constancy of for both sample and uni- 
verse”); techniques for, 91; theory of, 
61-65; and trace lines, 458-460; and 
trend analysis, 170; and trichotomized 
items, 294; utility of, 39, 88, 122-171, 
173, 177, 204, 331, 360, 493, 595 

Scalogram board: 92-97; construction of, 
93, 95, 96; diagrams of, 95; and inten- 
sity analysis, 221; photographs of, 92, 
94; and transference of balls, 97, 99, 100, 
108, 115; and use of first board, 97, 98, 
99, 100, 102~108, 115; and use of one 
board, 120, 121; and use of second 
board, 97, 99, 100, 108-115; and use of 
two boards, 96-99; utility of, 143-153; 
and utility for other purposes, 93 

Scalogram board technique of scale anal- 
ysis: 70, 88-89, 91-142; amount of work 
in, 121 

Scalogram items, relationship of with 
other models, 11-13, 460 

Scalogram picture, 290, 295 

Scattergram relating intensity to content: 
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301; see also “Intensity analysis, content 
scores vs. intensity scores” 

Schmeidler, G. R., 47n 

School, see “Education” 

School adjustment, see “‘Childhood school 
adjustment” 

School attendance prior to Army service, 
571, 587 

Score Card Plan, see “Army Score Card 
Plan” 

Score, reproducibility of in perfect scale, 
62, 66, 91, 141, 154, 174, 175, 181, 199, 
279, 493n 

Score, as a value of a scale variable, 64, 
66, 69, 72, 74, 76, 77, 89, 91, 154, 156, 
179, 184, 221, 279, 286, 305, 325, 481, 
547, 686 

Scores: critical, which see; in equal-appear- 
ing intervals, and reproducibility of re- 
sponses, 186; which “go in a circle,” 36, 
316, 318, 319-320, 321 (see also “Prin- 
cipal components”); in item analysis, 
183-184; noncritical, which see; part- 
whole reliability of, 296-298 

for persons, in scalograms illustrating 
principal components: 35-38, 212, 314— 
316, 318, 337-338, 360; correlation be- 
tween, 329-330; independence from 
weights, 314-315, 327, 336; oscillation 
of, 351-356; required condition for, 35, 
314-315; and weights, 315-316, 327~ 
328, 336, 339, 340 

of response patterns, 460-465, 468; 
for each scalable subuniverse, 175, 494; 
of single items, 465; split-half reliability 
of, 298-299, 310; test-retest reliability 
of, 308-311 

Scoring diagram for dichotomous scale, 
120 

Scoring of population, as determined by 
scalogram analysis: by nearest ideal 
scale type, 119, 120, 595, 610, 612; by 
simple weights, 119, 120, 458 

Screening: 486-489, 574; and changing 
situation, 505-511; and cheating, 488- 
489; defined, 486; exclusive and inclu- 
sive, 487-489; of “‘other’’ psychiatric 
cases, 558, 565; of psychiatric cases, 476, 
559 

of psychoneurotics: 15, 474-477, 478— 
479, 486-567; comparison of psychiatric 
diagnoses and test scores at all induction 
stations, 548-567; and original battery 
of tests, 492-499; original item lists for, 
512-538; and other psychiatric disor- 
ders, 502, 511, 559; and problem areas, 
479, 492-493; and proportion screened, 
487; and scalogram analysis, 478; tech- 
nicaldevelopment of teats, 486-547 
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of psychopaths, 502, 511, 558, 565; of 
psychotics, 502, 511; and vocational 
testa, 487 
Second component, see “Intensity compo- 
nent” 
Security, desire for: and government job, 
687, 668; and time of separation, 639 
Segmental probabilities: 377; see “Point 
distribution” 

Segmentalization, see ‘Point distribution” 

Segregation of Negroes on bus lines, 722 

Selective Service, 569 

Selective Service Act, 571, 623 

Self-confidence, see ‘Personal adjustment” 

Self-confidence and neurosis, 535 

Self-employment: see also ‘Postwar plans 
for business ownership”; defined, 646, 
647n; rates by age in U.S., projected to 
Army population, 656, 666 

Self-prediction, 573, 574, 680, 691 

Semi-nondirective items, see 
write-in” 

Semiskilled workers, plans of to return to 
same type of work, 638 

Sensory-motor measurements for screen- 
ing psychoneurotics, 487n 

Service commands: and disposition of pre- 
induction examinees, 551-552, 560; and 
NSA screening rate for inductees and 
rejects, 556-557, 560; and psychiatric 
diagnoses of psychiatric rejects, 553-554 

Service educational programs, 478 

Servicemen’s Readjustment Act of 1944, 
see “GI Bill” 

Shapiro, Gilbert, 90, 157n 

Sheffield, Fred, 96n 

Sherif, Muzafer, 4n, 47n 

Shot, 118 calibre, use of, 93 

Siblings, distribution for normals and neu- 
rotics, 513 

Sick call and psychiatric diagnosis, 490n 

Signature: for cross products, 398; for 
matrix, 386-391, 393 

Signing names and claiming psychosomatic 
symptoms, 504, 505 

Simple function, 64, 66, 69-72, 88, 91, 113, 
152, 154, 279, 296, 208-300, 331 

Single factor, in scalogram analysis and in 
factor analysis, 201-203, 204 

Single factor theory: 191, 203-204; and 
areas of psychiatric import, 498; and 
correlation between variables, 203; 
and quasi scales, 208; and scalogram 
analysis, 192, 206, 330; and unique 
factors, 204-205 

Single item: bias of, 48, 49, 87, 169; to 
determine both content and intensity, 
249, 252-253, 254-255, 256, 257; and 
generalised verbal intensity, 262-266; 


“Ttems, 
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nonindependence of content and inten- 
sity for, 252n; to represent area, 18, 87, 
123, 151, 215, 262 

Single trial, to estimate average reliability, 
304, 309 

“Size of circle” for principal components: 
316, 320, 321, 328; for constant compo- 
nent, 322-323; for infinite case, 325n; 
and oscillations, 353-357; and standard 
deviations, 324n; total, 322, 325n 

Skewness of distribution function for linear 
trace lines, 465 

Skilled workers, plans of to return to same 
type of work, 638 

Slats, removable, in scalogram board: 93; 
construction of, 95; horisontal, 97, 103; 
vertical, 97, 108 

Sletto, R. F., 85n 

Smaller War Plant Corporation, 569 

Sociability: 526-527; correlation of with 
criterion, 497; defined, 493, 526; dis- 
criminating value of, 479, 495, 526; list 
of items, 527; and multiple regression 
weights, 498; and other areas, 496, 497; 
as scale, 493; scores for normals and 
neurotics, 526 

Social change and prediction, 579 

Social classes, 376 

Social distance scale, 13, 86, 410 

Social planning, and prediction, 675 

Social Science Research Council, vi, 4, 722 

Social status, appraisal of, 61 

Social stratification and point distribu- 
tion, 376 

Soldier role, see under “Acceptance” 

“Solid streaks,’’ in scalogram analysis: 
105, 106, 108; relation of to cutting 
points, 110 

Southern states of U.S.: and NSA scores, 
558, 561; and variations in psychiatric 
diagnoses, 475; and variations in psychi- 
atric rejection rate, 554, 561 

Spanish language, 720n 

Spearman, C. E., 4, 6, 19, 163, 191-192, 
197, 201, 202, 203, 204, 206, 207, 208, 
209, 210, 211, 300, 311, 330, 422n 

Spearman split-half formula and split-half 
lower bound, 311 

Spearman-Brown prophecy formula and 
approximate scale, 300 

Specificity of public opinion, as contrasted 
to attitude, 87 

Split-half formula (Spearman’s), see 
“Spearman split-half formula” 

Split-half lower bound to reliability coefii- 
cient: 309-311; formula for, 310; and 
Spearman split-half formula, 311 

Split-half reliability, 298-299, 310 
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Sports, participation in, see ‘‘Participation 
in sports” 

Stability, see “Mobility” 

Stability of postwar plans, 579, 581, 689 

Stammering and neurosis, 520 

Standard deviation: 193; of latent distri- 
bution, 446; and qualitative variates, 
194; and quantitative variates, 195 

Standard error of sample coefficient of re- 
producibility, 77 

Standard error formulas developed, 709 

Standard Oil Company of New Jersey, 
638n 

Standards of psychiatric diagnoses and re- 
jection, varying, 475-477, 550, 552, 554— 
555, 561, 565, 567 

Star, Shirley A., vii, 474, 486n, 548n, 568n, 
709n 

Station hospitals, sampling of for psycho- 
neurotic criterion group, 491-492 

Statistical variable, 193 

Stereotype, 152, 670 

Stochastic variable, 193 

Stomach trouble: and neurosis, 537; and 
other psychosomatic complaints, 543 

Stomach ulcers as a critical sign, 511n 

Stouffer, 8. A.: vii, 3n, 362n, 458, 459, 
473n, 709n, 710 

ee by items, 386, 391-392, 398, 


Stratified sampling, 710, 711, 715 

Strength of feeling, 249, 276 

Studies of postwar plans: 568-708; for 
business ownership, 645-667; for educa- 
tion, 587-622; for farming, 639-645; for 
governmental employment, 667-675; 
and problems of prediction, 568-581, 
675-695; and procedures used, 581-587; 
questionnaires used, 695-708; for return 
to previous employer, 623-639 

Sturm, J. C. F., 357 

Sturm-Liouville boundary value problem, 
357 

Stuttering and neurosis, 520 

Subcategories, see ‘‘Categories’”’ 

Subclasses of population: 376; see also 
“Latent classes” 

Subsamples of items, 297 

Subuniverses of attitudes, 52, 53, 83, 84, 
87, 129, 144, 145, 150, 158, 163, 166, 
169, 173, 175, 176n, 177, 192, 243 

Suchman, E. A., 3n, 40, 90, 91n, 122n, 
213n, 568n 

Suggestibility of soldiers: 571, 583-585, 
593, 594, 669, 671, 679-681, 687 

Surgeon General’s Office, 474, 486n, 507, 
549 

S-value: adjusted (fitted): for latent di- 
chotomy (four or more items), 420; for 
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latent distance scaJe, 442; for restricted 
latent trichotomy, 437 
computed from latent marginals, 461; 
computation of, 419, 423, 435, 436, 437, 
443 
defined: for latent dichotomy (three 
items), 415; for integral marginal fre- 
quencies, 399, 415; for proportions, 374; 
for restricted latent trichotomy, 404 
and expected position of respondent, 
465 
“Sweating out a jump,” attitudes toward, 
see under ‘‘Attitudes toward” 
Szegd, Gabor, 357n 


T 


Tabulation technique of scale analysis, 91, 
143, 144 

Tactlessness and neurosis, 531, 533 

Taylor, C. C., 640n 

Taylor's series, 371 

Tchebychef, P. L., 357, 358 

Tennis in boyhood and neurosis, 523 

“Tentative” postwar plans for education, 
defined, 599 

Test construction, 454-472 

Test development, for screening psycho- 
neurotics, 486-547 

Test item: 363; see “Items” 

Test and pretest, differences in adminis- 
tration of: 502-512; effect of anonymity, 
504, 505; effect of change of criterion, 
507-508; effect of change in situation, 
505-507, 508, 510; effect of context, 
503-504 

Test, pure, see ‘Pure test” 

Test situation, 483-484, 690-693 

Testing, defined, 363 

Test-retest reliability: 10, 25, 277, 278, 
302-311, 467n; and average reliability 
of population, 304, 309; and marginal 
lower bound, 304-305; of psychosomatic 
complaints score, 503; of qualitative 
data, 302-308; and reproducibility as a 
lower bound, 305-308; and scalability, 
305, 311; of scale scores, 308-311; and 
Spearman split-half formula, 311; and 
upper bound, 307 

Tetrachoric coefficients: 71n, 195-199, 
201; and factor analysis, 200-201; and 
perfect Guttman scale, 198; and quali- 
tative data, 195, 197, 200 

Tetrads of cross products: in fully degen- 
erate latent trichotomy, 396; in latent 
dichotomy, 394, 421; in latent distance 
scale, 442, 455; in restricted latent tri- 
chotomy, 433-434 

Tetrads of point intercorrelations, 202 
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Third component, see “Closure compo- 
nent” 

Third Congress of the World Association 
of Public Opinion Research, 313n 

Thorndike, E. L., 168n 

Thurstone, L. L., 4, 6, 12n, 19, 32, 86, 
167, 185, 191, 192, 197, 201, 203, 204, 
206, 207, 209, 210, 211, 313n, 330, 362, 
422n, 447, 459, 460, 469, 472, 498 

Thurstone scale, see ‘““Equal-appearing in- 
tervals” 

Trace lines: 7, 32, 364-379, 395, 402, 414, 
460; “breaks” in, 364, 365, 411, 442; 
examples of, 365, 463; of Guttman scale, 
455-456, 458; and latent linearity, 446- 
447; linear, which see; number of inde- 
pendent, 371n; parabolic, which see; and 
point distributions, 377-379, 466-469; 
polynomial, which see; and population 
distribution, 370, 374; and pure tests, 
368; of response patterns, 460-465; and 
scalogram analysis, 458-460; shape of, 
364-366, 414, 432, 446, 459; and tre 
chotomies, 472; unimodal, 447; and 
wording of items, 459, 467 

Trace line scores: of response patterns, 
460-465; of single items, 465 

Trace surface, 453-454 

a linear, of scale variables, 

4, 65 

Transposed matrix, 388 

Trembling: and neurosis, 535; and other 
psychosomatic complaints, 540 

Trend analysis: 170; see also ‘Latent turn- 
over analysis” 

Trichotomization of items, 17, 119, 132 

Trichotomization of population: 300-301; 
by use of single items, 132, 151 

Trichotomous items, independent, exam- 
ple of: 293-295; scalogram analysis of, 
294, 295 

Trichotomous weights, for psychosomatic 
complaints score: differential, 500, 501, 
538-547 ; simple, 500, 501, 505, 509, 547; 
simple, and multiple regression, 546, 7 

Triplets of items: 398, 416; and restricted 
latent trichotomy, 434; sampling of, 425 

t-test, 194 

t-value: and K-value, 426; for latent d+ 
chotomy, 400-401, 415, 425-427, 449; 
for restricted latent trichotomy, 406, 
435, 436-437 

Two latent dichotomies, see ‘Latent tur- 
over analysis” 


U curve: see “Intensity component, shape 
of curve” 
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Ultimate frequencies, see “Response pat- 
terns, fitted” 

“Unbiased” items, 214-215 

Unconscious elements and predictions, 
483-484, 690 

Undecided respondents: ambiguity of, 
681; characteristics of, 579, 694; and 
plans classification, 578-579 

“Undecided’”’ responses: in intensity anal- 
ysis, 253; treatment of in latent struc- 
ture analysis, 402-403, 432-433; treat- 
ment of in scalogram analysis, 153 

Unemployment: postwar: 482, 577, 601, 
638, 693; in January 1946, 638; predic- 
tions of, 478, 571, 606 

prewar, 571 

Unidimensionality, 9, 26, 45, 46, 49, 55, 
60, 61, 63, 72, 77, 81, 88, 135, 142, 152, 
154, 155, 168, 181, 189, 200, 201, 213, 
216, 221, 275, 300, 331, 363, 366, 367, 
370, 422, 453, 479, 684, 685 

Uniform format, need for, 15 

Unimodal trace lines and latent linearity, 
447 

Unique factors, in single factor theory, 
204-205 

Universal military training, see “Attitude 
toward postwar conscription”’ 

Universe: as distinct from scale, 84; em- 
pirically defined, 83-85; nonscalable, 
173; “operational,” 166; structurally 
defined, 80n 

Universe of attributes, 51-53, 64, 65, 80— 
88, 158, 172, 174, 214, 218, 239, 275, 
286, 456 

Unskilled workers, plans of to return to 
same type of work, 638 

US. Census of 1940, 646, 647n, 656, 666 

U.S. Office of Education, 569 


Vv 


Validity: external, as problem of predic- 
tion, 57-59; internal, as problem of defi- 
nition, 57-58, 139, 150-151; relation be- 
tween internal and external, 58, 139, 
141; as used by Guilford, 184n (see 
“Scalability”’) 

Variability within the respondent, 302-311 

Variable, as a set of values, 63 

Variables: and their frequency functions, 
193; introduction of additional, as shown 
by failure of universe to continue to 
scale, 82; presence of additional, as 
shown by nonscale types in scalogram, 
78, 119; random, statistical, stochastic, 
193 

Variance: analysis of, 173, 174, 194; of dis- 
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tribution curve, 374; of scale scores 
among individuals, 310; of scale scores 
within individuals, 308-309; of scores 
for persons, 35, 39, 314, 337-338; of 
weights for items, 35, 39, 314-315, 338- 
339 

Variate, 193-194 

Veltfort, H. R., 47n 

Verbal habits, 16n, 42, 43, 111, 210, 253, 
264, 266, 484 

Veteran preference for government jobs, 
attitude toward, 158 

Veterans, plans of, see ‘Postwar plans” 

Veterans Administration, vii, 477, 569, 
578, 604, 605, 606, 622, 660n 

Vocational ability and criterion-exclusive 
screening, 487 

Vocational guidance, 478 

Voting as a qualitative act, 193 

v-values, defined, 404, 409 


W 


WAC: attitude toward, which see; and 
nurses, 132 
Wallin, Paul, 176n 
War, effect of on trends and relationships, 
576, 587, 666-667 
War aims, belief in, and neurosis, 528 
War Department, v, 478, 569, 602, 711 
War length: and education (summer 1945), 
608, 620; and new origin situation, 691; 
and postwar plans for education (sum- 
mer 1943), 589, 594; and postwar plans 
for education and educational level, 594, 
685; prediction of, and desire for com- 
bat, 692n; and return to previous em- 
ployer, 633 
War motivation and areas of psychiatric 
import, 493 
Washington Statistical Society, 483n, 689n 
Weights: assigned by judges, and behavior 
of respondents, 186; in equal-appearing 
intervals, 185-186; ‘which “go in @ cir- 
cle,’”’ 316, 320, 327, 328 (see also “‘Prin- 
cipal components”) 
for items, in scalograms illustrating 
principal components: 35, 36, 38, 212, 
314-316, 327-329, 338-339, 360; and 
average for each item, 318-319; correla- 
tion between, 330; independence from 
scores, 314-315, 327, 336; oscillation of, 
356-357 ; required condition for, 35, 314; 
and scores, 315-316, 318, 319, 327-328, 
336, 339, 340 
for multiple regression : 497-500, 538n, 
543; derivation of, 538-547; and scale 
scores, 500, 547 


756 


to obtain index, 174-175, 178; for 
order of demobilization, 189; for order- 
ing latent classes, 446, 468; from paired 
comparisons, 190-191 
for prediction purposes: 156, 178-179, 
499-500, 547; and equal-appearing in- 
tervals, 187; and item analysis, 182, 183; 
and nonscalable items, 156, 176, 178; 
and principal components, 361; and 
scalable items, 156, 175, 494 
in scalogram analysis: vs. ideal scale 
type method, 119-120; and initial order- 
ing of respondents, 105-106, 108; to 
score population, 119-120 
Welfare, attitude toward the Army’s con- 
cern with, 158 
Wherry, R. J., 470n 
Williams, F. D., 709n, 711n, 713n 
Williams, R. M., Jr., 486n 
Wittson, C. L., 486n 
Women in the Army, attitude toward, 132 
Women’s Army Corps, attitude toward, 
see “Attitude toward the WAC” 
Wording of items: 286, 625, 652n, 669, 709, 
722; and intensity analysis, 249; and 
latent structure analysis, 402, 414, 459; 
need for independence from, 34, 42, 80, 
214, 215; and scalogram analysis, 150— 
153, 158; and trace lines, 459, 467 
World Association of Public Opinion Re- 
search, 313n 
Worrying: 530-531; correlation of with 
criterion, 497; defined, 493, 530; dis- 
criminating value of, 479, 495, 530; list 
of items for, 530-531, 534; and multiple 
regression weights, 498; and other areas, 
496, 497; as scale, 493; scores for nor- 
mals and neurotics, 530 


XYZ 


z-value, in latent distance scale, defined, 
411 

Yardstick items, 459-460 

Yule, G. U., 6, 198n, 381n, 382n 
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Y-value: for latent dichotomy, 309 (see 
“S-value’’); for latent trichotomy, 399 

Zero: as mean rank of principal component 
scores, 318, 322, 340; as mean weight of 
each item, 318-319, 341, 345 

Zero cell, in fourfold table of scalable 
items, 71-72, 74, 198, 200, 202, 286 

Zero correlation in homogeneous system, 
382, 384 

Zero linear correlations between principal 
components, 39, 204, 329-330 

Zero-order correlations: 192, 196, 202; in 
factor analysis, 192, 202, 471; in Gram- 
ian matrix, 197, 201; between items 
of Guttman scale, 70-71, 76, 156, 1€2; 
as pedagogical device, 71 

Zero-order joint positive frequency, 381 

Zero-order weights for iteration, 539, 543, 
545 

Zero point: 5, 33-34, 41-43, 46, 73n, 102, 
151, 215-219, 221, 226, 243, 249, 253, 
312, 324, 326; caution toward, 41 (see 
“Zone of relative indifference’); empiri- 
cal example of freedom from item word- 
ing, 266-275; independence from gener- 
alized verbal intensity, 43—44, 261, 262; 
independence from item bias, 42, 216, 
266, 269-275; independence from sanm- 
pling of items, 216, 218, 275-276, 301, 
326; and informational items, 233-234; 
as located by median content rank for 
respondents with lowest intensity scores, 
217, 227, 229, 236, 239, 302; sampling 
error of, 41, 219; and size of sample, 283; 
and zone of relative indifference, 41n, 
229 

Zero range, see ‘Zone of relative indifle- 
ence” 

Zero reliability, 302, 309 

Zero tetrad, see “Tetrads of cross prod 
ucts” 

Zone of relative indifference, 41, 42, 43, 
226, 233, 236, 243, 276 

Z-value: for latent dichotomy, 399; for 
latent trichotomy, 399 


Tue Sociau Science Researcu Counci was or- 
ganized in 1923 and formally incorporated in 1924, 
composed of representatives chosen from the seven 
constituent societies and from time to time from re- 
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cil to plan, foster, promote, and develop research in 
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